The Highlights from the SAP HANA Cloud Workshops

SeungjoonLee · ‎10-23-2020

Over the last three weeks, taesuk.son and I co-hosted SAP HANA Cloud Workshops with the following agenda.

Session 1: Access Data Across Your On-Premise and Cloud Data Sources

Session 2: Running queries with Federation and Import/Export

Session 3: A Single Gateway to All Your Data

As all sessions are now available on-demand with detailed workbooks you can follow, if you missed any of them or would like to watch and learn again, please refer to the below link.

SAP HANA Cloud Workshops

This blog will also provide highlights of each session so that you can make life easier by finding individual topics quickly. Before starting the main topics, I would like to address a couple of the most common questions during the workshop here again: “What are the differences between SAP HANA Cloud and SAP HANA Enterprise Cloud (HEC)?” and “Why is the hybrid scenario important?”

For the first question, SAP HANA Enterprise Cloud (HEC) is a managed service for hosting SAP applications including the existing SAP HANA 2.0, platform edition in the cloud (IaaS). SAP HANA Cloud is a new cloud-native product and managed database service embracing K8s, cloud elasticity and so forth.

Regarding the second question, the hybrid scenario is growing in importance as enterprises can achieve flexibility and agility with private or public cloud to meet new and changing business demands while running mission-critical workloads in on-premise data centers.

With that, let me briefly recap each topic for using SAP HANA Cloud effectively from a hybrid scenario perspective. Please also refer to the timestamps for quickly navigating each topic in the on-demand recordings.

Session 1: Access Data Across Your On-premise and Cloud Data Sources

In this session, you learn how to activate and signup for a SAP HANA Cloud trial. With your SAP HANA Cloud trial instance, the topics of enabling hybrid scenarios with both SAP HANA smart data access (SDA) and SAP HANA smart data integration (SDI) are introduced.

Both capabilities can be used for hybrid scenarios with SAP HANA Cloud, but SDA only supports to create a remote source to SAP HANA Cloud from SAP HANA on-premise (SPS04 Rev45+) at this point. The reverse way, creating a remote source to SAP HANA on-premise from SAP HANA Cloud with SDA will be supported in Q4 2020. SDI can be used for both directions.

Provisioning your SAP HANA Cloud trial instance [00:00]
- This is the prerequisite for all sessions as your SAP HANA Cloud trial instance is the main place to proceed all hands-on exercises

Creating a user and assign roles [04:19]
- In this step, assigning needed privileges for SDI/SDA is addressed

Creating an SDA connection from the SAP HANA on-premise to the SAP HANA Cloud [06:23]
- This step requires your SAP HANA on-premise (in this demo, SAP HANA 2.0, express edition is used) and this hybrid SDA connection is officially supported from SAP HANA 2.0 SPS04 Rev45 or higher

Setting up the Data Provisioning Agent (DPAgent) on your SAP HANA on-premise [10:10]
- DPAgent is installed to the same Linux virtual machine where SAP HANA 2.0, express edition is installed

Creating a connection using the DPAgent configuration tool [13:10]
- SAP HANA Cloud instance is connected by using JDBC with SSL encryption

- Preparing a user with privileges and creating a source table in SAP HANA, express edition [19:31]
- Creating a remote source to SAP HANA, express edition from SAP HANA Cloud using SDI HanaAdapter, and creating a virtual table [21:11]
- Real-time replication with SDI from SAP HANA, express edition to SAP HANA Cloud [25:08]

Session 2: Running queries with Federation and Import/Export

The second session is about new capabilities with SAP HANA Cloud including importing data from Amazon S3, and data federation with Google BigQuery and SAP HANA Cloud, data lake.

For these exercises, Amazon S3, Google BigQuery and SAP HANA Cloud, data lake are already prepared for you. So, you can simply follow the steps with your SAP HANA Cloud trial instance as described in the workbook.

For accessing SAP HANA Cloud, data lake, federation over federation concept via another SAP HANA Cloud on it is used as direct access to another SAP HANA Cloud, data lake is not supported as this point. We also use TPC-H data and queries to explain the details.

Importing data from Amazon S3
- Preparations prior to the actual exercises (For reference only) [00:00]
- Creating and registering Amazon S3 Baltimore CyberTrust Root certificate, and importing TPC-H schema from Amazon S3 [03:04]
- Importing previously exported schema TPCH from Amazon S3 [05:17]
- Importing csv files from Amazon S3 [09:54]
- Checking the imported data [12:34]

Data federation with Google BigQuery
- Importing a certificate for SSL connection to Google BigQuery remote source [13:41]
- Prepared data in Google BigQuery, and creating a key as a JSON file (For reference only) [17:42]
- Creating a remote source to Google BigQuery and creating a virtual table [21:39]
- Executing TPC-H query (Q2) with/without the virtual table [30:17]

Data federation with SAP HANA Cloud, data lake
- Prepared data in SAP HANA Cloud, data lake (For reference only) [35:13]
- Accessing SAP HANA Cloud, data lake via another SAP HANA Cloud instance by creating a remote source and creating a virtual table [37:56]
- Executing TPC-H query (Q1) with/without the layered virtual tables (federation over federation concept) [42:25]

Creating a consolidated view [47:00]
- In this step, for the purpose of wrapping up, a consolidated view is created by using TPC-H query (Q4)

Session 3: A Single Gateway to All Your Data

The last session is about another new capability of SAP HANA Cloud, toggling between virtual table and replica. And we also revisit the federation over federation concept by using both SAP HANA Cloud and SAP HANA on-premise (SAP HANA, express edition) with existing static result cache. Last but not least, working with the SQL Analyzer is also addressed.

Federation over federation (over federation) and hybrid queries
- In SAP HANA Cloud, creating a virtual table additionally and execute TPC-H query (Q5) with/without virtual tables [00:00]
- As those queries are on-the-fly federation queries, those can take long time depends on their table sizes and complexity of queries
- In SAP HANA on-premise (SAP HANA, express edition), creating a remote source to SAP HANA Cloud and creating a virtual table, and execute some simple federated queries based on layered virtual tables [11:51]
- Checking the result in SAP HANA Cloud [18:55]
- Checking the result in SAP HANA on-premise (SAP HANA, express edition) [20:17]

Toggling between virtual table and replica [22:27]
- With SAP HANA Cloud, switching between virtual table and replica is possible
- Comparing performances between on-the-fly federated queries and queries with snapshot replicas in SAP HANA Cloud
- Comparing performances between on-the-fly queries with layered virtual tables and queries with layered virtual tables and snapshot replica in SAP HANA on-premise (SAP HANA, express edition) [30:58]

Using static result cache on a view with virtual table [34:16]
- Static result cache is available for both SAP HANA Cloud and SAP HANA on-premise

Working with the SQL Analyzer
- Analyzing queries with Explain Plan, and join relocation concept [38:13]
- Visualizing Explain Plan [47:38]

This concludes the highlights from the SAP HANA Cloud Workshops. I hope that you could enjoy these learning contents and we can provide new opportunities in the near future.

Please also check out the Extend Your SAP HANA On-Premise to SAP HANA Cloud, SAP HANA Database tutorial and more on SAP HANA Cloud.