Getting ready for Big Data with Data Services Webcast – Part 1
SAP’s Ina Felsheim and Paul Medailles provided this webcast this past week.
Figure 1: Source: SAP
Figure 1 covers the “3 V’s” of Big Data. Organizations are dealing with larger and larger datasets.
Volume is lots of data.
Variety of data includes structured and unstructured data.
Velocity covers how fast
The 4th V is value, offering challenges and opportunities.
Figure 2: Source: SAP
The majority of businesses deal with less than 100 TB of data today.
Most companies expect data requirements will grow
TDWI expects datasets to be larger than 100 TB by 2015
Figure 3: Source: SAP
Figure 3 shows that rather than running a query, you’re mining data, and not sure what you will find.
You just want tools to find the data to deliver insights
Figure 4: Source: SAP
Figure 4 shows the use case for real time analysis.
On the left, only at end of business process, with latency, take the data and extract it into enterprise data warehouse, and then do analysis
With tools have today, we can do real time analysis (on the right of Figure 4). You can track a process, be in the middle of a process while it is occurring
Figure 5: Source: SAP
Figure 5 shows Data Services, with ETL for data layer, not process layer
It has data quality management capabilities
Third capability is text data processing
Figure 6: Source: SAP
Data Services is not completely SAP-centric, as shown in Figure 6.
Data profiling covers understand the quality of the data
The core data services engine includes jobs for data cleansing, enrichment and validation.
Figure 7: Source: SAP
Figure 7 shows a new format called VCF “Variant Call Format” for gene sequencing
Hadoop is supported as source or target
Data Services 4.2 has new adapter SDK – build connectivity to any back end system as a source or target
Add change data capture
To be continued – slides for the webcast are here: http://public.brighttalk.com/resource/core/28131/navigating-big-data-with-sap-data-services_42469.pdf
Subset of Question & Answer
Q: What skillsets needed for Data Services?
A: It depends on use case
For data migration, a certain set of skills
To develop extract transform load jobs – need a SQL developer
Q: How is it licensed? Is it separate from BI?
A: It is a separate license from BI; licensed by cores
Q: Do you have to be an SAP customer to take advantage of Data Services?
A: No; Data Services – still have a lot of customers use it