Data Intelligence – Considerations for Cloud or On-premises/BYOL installation
A common question I’ve been addressing with customers lately on Data Intelligence is “which version should we choose – DI Cloud or on-premises?”
My goal is to provide some guidance on which deployment Data Intelligence is available as either a cloud subscription model or an on-premises/bring your own license (BYOL) installation. Through this post, I will outline the considerations in several areas:
- Deployment options
- Other influencing factors
The most common question is “what differences are there between the two options?” This question is probably rooted in the conversion of Data Hub to Data Intelligence earlier this year. When Data Intelligence was released, it included the ML Scenario manager and ML Data Manager, but these were not available in Data Hub. Data Intelligence BYOL and Data Intelligence Cloud have full feature parity. There is no difference between the deployment options!
There are similar concepts for Data Intelligence Cloud and On-premises for sizing and licensing. This is a much deeper topic but I suggest start by defining a set of use-cases and then review the sizing calculator options and then expand the recommendation by 15-20% to allow for growth. The sizing calculators have detailed definitions that help you narrow down sizing requirements.
In general you need to size on-premises systems for blocks of 64GB memory in the kubernetes environment. The sizing calculator can help you determine how many blocks are needed. The minimum configuration is 3 nodes in the Kubernetes cluster – see the help documentation for more details (https://help.sap.com/viewer/a8d90a56d61a49718ebcb5f65014bbe7/3.0.latest/en-US/7e2a9bf62ec94e9694648e2b5d2ce882.html)
For Data Intelligence Cloud, the minimum configuration is 4300 capacity units.
Here are direct links to the sizing calculators:
DI On-premises Sizing Calculator
DI Cloud Sizing Calculator
Data Intelligence on-premises installation has been simplified with the Software Lifecycle Bridge (SLCB) and Maintenance Planner. There are a lot of blogs (including one I wrote) on building an installation host or jumpbox for Data Hub. With Data Intelligence the installation host/jumpbox is no longer needed. There are other blogs that detail installation of Data Intelligence 3.0 such as this one from Dimitri Vorobiev.
For on-premises maintenance, patches are installed via the same SLCB and maintenance planner. It is a very easy process to update your installation (but be sure to follow the prerequisites on help.sap.com).
Data Intelligence Cloud is provisioned on your SCP account (it is available as a subscription model or under Cloud Platform Enterprise Agreement (CPEA). Simply provision it in your account and wait for it to be ready.
For the cloud instances. there are monthly scheduled maintenance windows where updates are applied.
Data Intelligence on-premises can be installed in any certified environment. This includes the hyperscalers and kubernetes on-premises. As of this blog post, the minimum kubernetes version should be at least 1.14.x (but that is a bit outdated now and I would recommend moving to at least 1.16.x) For the current list of supported bookmark this OSS Note: https://launchpad.support.sap.com/#/notes/2871970.
Data Intelligence Cloud is deployed only on AWS and Azure as of the time of this blog and not in every region.This may influence your decision to install it locally to your current region/availability zone. For the current list of options see here – be sure to filter for Data Intelligence in the search box below the map view: https://help.sap.com/doc/aa1ccd10da6c4337aa737df2ead1855b/Cloud/en-US/3b642f68227b4b1398d2ce1a5351389a.html
As of this blog these are the deployment options as of today (Oct 14, 2020) but more locations will be available in the future:
For Data Intelligence Cloud, we have the cloud connector option to reach into your on-premises environment securely.
Other Influencing Factors
Factors that should be considered for Cloud vs. On-premises include:
- The overall corporate strategy – if part of your digital transformation is moving to a cloud-based platform, it makes sense for Data Intelligence to be the backbone of your Business Transformation Platform in the cloud.
- Where is the bulk of the data? If you have a large ECC or S/4 HANA on premises where most of the data will be accessed, you want to consider the cloud ingress/egress costs and potential data movement. If you’re using HANA Cloud as part of the overall future strategy, you may want to deploy a cloud option as your future-state will be in the cloud.
- Deep learning Machine Learning with GPUs – Part of the Cloud deployment options is certified GPU nodes for kubernetes. If you want GPU’s on premises you have to refer to the certified GPU node types and include them into your kubernetes environment. https://launchpad.support.sap.com/#/notes/2900587
There are many factors in determining which option to select for your Data Intelligence application. The simplest is the cloud option as it does take all the kubernetes care and maintenance out of the equation as well as patching and upgrades.
Reach out or add a comment below if you have any other ideas on what should be considered!