SAP HANA Vora: Graphical Modelling tool basic exam...

Technology Blogs by Members

Explore a vibrant mix of technical expertise, industry insights, and tech buzz in member blogs covering SAP products, technology, and events. Get in the mix!

SAP Hana Vora is a 'Big Data' In-memory reporting engine sitting on top of an Hadoop Cluster.

Data can be loaded into the Hadoop Cluster memory from multiple source e.g. HANA, The Hadoop File system (HDFS), remote files systems like AWS S3

With the release of SAP Hana Vora 1.2 it's now possible to graphically model views (e.g. joining multiple datasets) similar to a Hana calculation view.

The following link has all the details to get you started with Vora SAP HANA Vora - Troubleshooting

This blog contains a very basic introductory example of using the new graphical modelling tool.

The steps are:

Create 2 example datasets in HDFS, using scala and spark
Create Vora tables, linked to these files
Model a view joining these tables, and filtering on key elements

Firstly the following 2 datasets need to be created for transactional and master data (reporting attributes).

Transactional Data

COMPANYCODE	ACCOUTGROUP	AMOUNT_USD
AU01	Revenue	300.0
GB01	Revenue	1,000.0
US01	Revenue	5,000.0
US01	Expense	-3,000.0
US02	Revenue	700.0

Master Data

COMPANYCODE	DESCRIPTION	COUNTRY
AU01	Australia 1	AU
GB01	United Kingdom 1	UK
US01	United States of America 1	US
US02	United States of America 2	US

In the following steps open source Zeppelin is used to interact with Vora, Spark and HDFS.

Open Zeppelin and create a new notebook.

Next create the sample data using Spark and Scala.

Create sample Company Data and save to HDFS
fs.delete(new Path("/user/vora/zeptest/companyData"), true) val companyDataDF = Seq( ("GB01","Revenue", 1000.00), ("US01","Revenue", 5000.00), ("US01","Expense",-3000.00), ("US02","Revenue", 700.00), ("AU01","Revenue", 300.00)).toDF("Company","AccountGroup","Amount_USD") companyDataDF.repartition(1).save("/user/vora/zeptest/companyData", "parquet")

Create sample Company Data and save to HDFS

fs.delete(new Path("/user/vora/zeptest/companyData"), true)

val companyDataDF = Seq(

("GB01","Revenue", 1000.00),

("US01","Revenue", 5000.00),

("US01","Expense",-3000.00),

("US02","Revenue", 700.00),

("AU01","Revenue", 300.00)).toDF("Company","AccountGroup","Amount_USD")

companyDataDF.repartition(1).save("/user/vora/zeptest/companyData", "parquet")

Create sample Company Master Data and save to HDFS
fs.delete(new Path("/user/vora/zeptest/companyAttr"), true) val companyAttrDF = Seq( ("GB01","United Kingdom 1", "UK"), ("US01","United States of America 1", "US"), ("US02","United States of America 2", "US"), ("AU01","Australia 1", "AU")).toDF("Company","Description", "Country") companyAttrDF.repartition(1).save("/user/vora/zeptest/companyAttr", "parquet")

Create sample Company Master Data and save to HDFS

fs.delete(new Path("/user/vora/zeptest/companyAttr"), true)

val companyAttrDF = Seq(

("GB01","United Kingdom 1", "UK"),

("US01","United States of America 1", "US"),

("US02","United States of America 2", "US"),

("AU01","Australia 1", "AU")).toDF("Company","Description", "Country")

companyAttrDF.repartition(1).save("/user/vora/zeptest/companyAttr", "parquet")

Lets now check in HDFS that the directories/files have been created

Directory listing in HDFS
import org.apache.hadoop.fs.FileSystem import org.apache.hadoop.fs.Path val fs = FileSystem.get(sc.hadoopConfiguration) var status = fs.listStatus(new Path("/user/vora/zeptest")) status.foreach(x=> println(x.getPath))

Directory listing in HDFS

import org.apache.hadoop.fs.FileSystem

import org.apache.hadoop.fs.Path

val fs = FileSystem.get(sc.hadoopConfiguration)

var status = fs.listStatus(new Path("/user/vora/zeptest"))

status.foreach(x=> println(x.getPath))

Next use the %vora option in Zeppelin to create the Vora tables

Create the Vora Tables
%vora CREATE TABLE COMPANYDATA( COMPANYCODE VARCHAR(4), ACCOUNTGROUP VARCHAR(10), AMOUNT_USD DOUBLE ) USING com.sap.spark.vora OPTIONS ( tableName "COMPANYDATA", paths "/user/vora/zeptest/companyData/*", format "parquet" )
%vora CREATE TABLE COMPANYATTR( COMPANYCODE VARCHAR(4), DESCRIPTION VARCHAR(50), COUNTRY VARCHAR(2) ) USING com.sap.spark.vora OPTIONS ( tableName "COMPANYATTR", paths "/user/vora/zeptest/companyAttr/*", format "parquet" )

Create the Vora Tables

%vora CREATE TABLE COMPANYDATA(

COMPANYCODE VARCHAR(4),

ACCOUNTGROUP VARCHAR(10),

AMOUNT_USD DOUBLE

)

USING com.sap.spark.vora

OPTIONS (

tableName "COMPANYDATA",

paths "/user/vora/zeptest/companyData/*",

format "parquet"

)