SAP VORA Modeling
What is SAP VORA?
SAP Vora is an in-memory query engine, which allows big data to access from the Hadoop. It uses HANA data to process real time data. Vora runs on commodity hardware with accessing huge amount of data with low cost and can integrate with enterprise data.
Modeling allows user to create and modify the data through the web based GUI interaction. Modeling allows user to create, modify and cleanse the data by click event. Modeling allows to create tables and views on different data formats as like the SQL queries with its rich set of data types.
Modeling allows below features to use:
- Allow to create the table with its rich set of data types.
- Allow to create table with different file formats (csv, orc, parquet…)
- Once creating table, it allows to append another table to the existing table.
- Table can be created using the different sources like HANA, Hadoop, S3.
HANA Data Processing:
Using Vora we can connect to the Hana and access the Hana data to the Vora and combining with the big data in Hadoop using spark integration. Once cleansing the data.
When data is huge and unable to fit in to the In-memory during processing, then you can use Vora Disk engine feature, where you create table in the Disk engine and processes data into distributed query processing system.
As Disk engine is integrated with Spark SQL, so data can be accessed as like In-memory tables.
While creating the table in the disk engine it uses as ‘USING com.sap.spark.engines.disk’ at the backend to create table in the disk engine.
Time Series Tables:
As like normal tables, Vora allow us to create time series tables, which is based on the time data or based on the regular time intervals of data. This engine will do data compression in in-memory. This engine take care of correlation and trends in the data. You can create partition and store the data.
This Time Series Table is created using the ‘USING com.sap.spark.engine’ at the backend.
Once Time series table is created, that table can be accessed using the SQL or through the modeler and get the information based on appropriate time.
As like other view, SAP Vora also allow to create views using the data source or sources in the web based GUI form. Views are the virtual tables.
Vora allow to create view on top of the table or tables to extract meaningful information or data.
Using view, you can do the following:
Join: You can create view and add multiple tables to the view and create join to get the appropriated data between the two tables.
ResultSet: You can create Resultset between added tables to extract meaningful data.
Having and Group By clause: You can add Having clause to filter data based on the Having condition on added table and you can group the data based on the selected columns.
Where Condition: To restrict the data displaying based on the given Where condition.
SubSelect: It is kind of adding or creating view inside the another view and joining to get the meaningful information or data from the multiple data sources.