The SAP Data Hub Pipeline engine provides by default more than 250 operators and more than 100 sample graphs. These operators and graphs are a very good tool set for most situations. However, there might be situations in which one needs a Pipeline operator with additional capabilities or additional sample graphs that can help to get started. This is where the new GitHub repository, the SAP Data Hub Integration Examples, can be of help. The repository was created with the goal to share sample code within the SAP community.
To start, the repository contains two custom operators and two sample graphs. Those examples were created based upon feedback from SAP Data Hub projects.
- The sample graph “HANA exposed via OpenApi” demonstrates how to expose HANA tables through a REST endpoint using the “OpenApi Servlow” operator.
- The Hive operator provides functionality to query a Hive Metastore server using HiveQL. It also supports Kerberos.
- The sample graph “Java Process Executor” demonstrates how to run a Java application in Pipeline graph using the “Process Executor” operator. Details on how this operator and the sample Java application can be created are described the blog Running a Java Application in a Data Pipeline.
- The ParquetWriter operator provides functionality to create a file in Parquet format from an input message, e.g. from the result of query of a HANA Client operator.
The intent is to add other operators and graphs in the future. The examples can be used free of charge on SAP Data Hub 2.3 or newer. Problems can be reported via GitHub issues.
Please download and try the examples. And, of course, please share your feedback with us!