Skip to Content

ASUG News Craig Powers wrote about it here: SAP Looking to Go Big on Self-Service Analytics with Hadoop and Lumira – ASUG News

Below are my notes (Part 1):

SAP’s Paul Ekeland provided this ASUG webcast last week.  Please note the usual legal disclaimer applies that things in the future are subject to change.

/wp-content/uploads/2015/04/1fig_695434.jpg

Figure 1: Source: SAP

Big Data is popular; this is explained by 2 factors – cost effective way to store information

Hadoop allows you to store data on commodity software

It is not just about cost; also do not have to think ahead of how shape information in Hadoop system

Think about how you are going to use it

/wp-content/uploads/2015/04/2fig_695435.jpg

Figure 2: Source: SAP

Hadoop is spacious but slow like a bus;  HANA is like “racing cars” in terms of speed

/wp-content/uploads/2015/04/3fig_695436.jpg

Figure 3: Source: SAP

Figure 3 covers “Hub and spoke”, storing data in a data lake

Put data marts or enterprise data warehouse on top of it so it would extract it and stage where you plug BI apps

If you access directly in data lake can be slow

Data exploration possibilities include exploring and try to figure out high level information

Hub and spoke architecture is becoming the standard

/wp-content/uploads/2015/04/4fig_695437.jpg

Figure 4: Source: SAP

Hive provides SQL access to data in HDFS

Oozie workflow allows you to schedule jobs in Hadoop

Hadoop system is open source

/wp-content/uploads/2015/04/5fig_695438.jpg

Figure 5: Source: SAP

Figure 5 covers how HANA & Hadoop work together

Hadoop – investigate; once know you what you want to extract, push data to HANA; it will operationalize information like “no other tool”

/wp-content/uploads/2015/04/6fig_695439.jpg

Figure 6: Source: SAP

Companies have “mountains” of information

Linked in shows 18% of jobs are related to data

There is a talent gap in the market

/wp-content/uploads/2015/04/7fig_695440.jpg

Figure 7: Source: SAP

Lumira addresses visuals

As soon as you want to understand which part will break, need predictive.    Two tools “play nicely together”

/wp-content/uploads/2015/04/8fig_695441.jpg

Figure 8: Source: SAP

Both BI and Lumira share the same datasources

Future includes SparkSQL, MongoDB, Graphs

Data access extensions are available for Lumira


SAP has partnerships with vendors such as Cloudera, Hortonworks, MapR to ensure they work

/wp-content/uploads/2015/04/9fig_695442.jpg

Figure 9: Source: SAP

Machine sensors send event every second

Hadoop is slow; its level of SQL is limited

/wp-content/uploads/2015/04/10fig_695443.jpg

Figure 10: Source: SAP

Data prep is in Lumira; then schedule jobs via Ooozie to generate full dataset

Load data to Lumira, share via various flavors of Lumira

In HANA connect via Smart Data Access

You can use Impala or HIVE driver

/wp-content/uploads/2015/04/11fig_695444.jpg

Figure 11: Source: SAP

Generate table automatically so visualizations created will re-point to virtual table

/wp-content/uploads/2015/04/12fig_695445.jpg

Figure 12: Source: SAP

Figure 12 shows the planned self service on Hadoop, starting with sampling the data, scheduling to generate the dataset, Hadoop to access visualizations, publishing to Lumira Server

/wp-content/uploads/2015/04/13fig_695446.jpg

Figure 13: Source: SAP

Figure 13 shows planned deliverables

Part 2 is coming when I have time.

What do you think of SAP Lumira and Big Data?

To report this post you need to login first.

Be the first to leave a comment

You must be Logged on to comment or reply to a post.

Leave a Reply