Information Steward architecture for quick reference
AIM:- The purpose of the document is to provide basic quick understanding of what is Information Steward and how you can help your colleagues in the organization to resolve any technical issues by understanding the architecture and where does it fit in SAP Data Management space and how it relates to SAP Analytics.
SAP Data Management:- SAP offers various products which allows integration of disparate source systems , collect/extract data, transform, cleanse, validate and load /distribute data to different target systems like SAP Data Services, SAP Master Data Management, SAP Adaptive Server, SAP Information Steward etc.
SAP Analytics:- SAP offers various products which allow the business analysts to query, report and analyze the information that is loaded using SAP Data Management products so that they can predict the forecasts and take decision accordingly like SAP Business Objects Enterprise, SAP Crystal Reports, SAP Dashboards
What is SAP Information Steward:- SAP Information Steward is part of SAP Enterprise Information suites under Data Management space that allows data stewards, business analysts to discover, assess, define, monitor and improve the quality of the data via several modules:-
- Data Insight: Profile data, create and run validation rules, monitor data quality through scorecards,
- and create data cleansing solutions based on your data’s content-type identification results and SAP
- best practices for your specific data.
- Metadata Management: Catalog the metadata across their system landscape, analyze and understand
the relationships of their enterprise data.
- Metapedia: Define business terms for data and organize the terms into categories.
- Cleansing Package Builder: Define cleansing packages to parse and standardize data.
- Match Review: Review results of automated matching on a regular basis and make any necessary corrections.
Match Review maintains a list of records in the “My Worklist” tab that involves reviewers’
actions for match decisions.
Pre-requisites for SAP Information Steward landscape:-
In the typical Information Steward landscape, you must first install one of the following products. These products provide platform services such as security, scalability, and high availability for Data Services and Information Steward.
- SAP BusinessObjects Information Platform Services (IPS) if you only want to use features of Information Steward
- SAP BusinessObjects Business Intelligence platform (BI platform) if you also want to use Business Intelligence clients such as Web Intelligence documents or Crystal Reports
The following diagram helps to understand where Information Steward sits among SAP Data Services and SAP Business Objects Business Intelligence platform.
SAP Information Steward architecture and relationship with SAP Data Services and SAP Business Objects Business Intelligence Platform:–
SAP Information Steward uses SAP BusinessObjects Business Intelligence (BI) platform and SAP DataServices and inherits the scalable architecture that these two platforms provide.
SAP Information Steward requires SAP BusinessObjects BI Platform for the following functionality:
- Manage user and group security
- Schedule and run on-demand services for Metadata Management (integrator sources and utilities)
- Schedule and run on-demand services for Data Insight (profiling and rule tasks and utilities)
- Perform administrative tasks for Information Steward with the Central Management Console (CMC)
- Scalability by load balancing and high availability
SAP Information Steward requires the following components of SAP Data Services:
- The Data Services Job Server installed on the primary computer.
- The Data Services Job Server provides the engine processes that perform the Data Insight data
profiling and validation rule tasks. The engine processes use parallel execution and in-memory
processing to deliver high data throughput and scalability.
- The Data Services Metadata Browsing Service provides the capabilities to browse and import the
metadata from Data Insight connections.
- The Data Services View Data Service provides the capabilities to view the source data from Data
Insight connections
The Data Services Job Server provides the following system management tools that are required during the first installation of
Information Steward:
Repository Manager:- The Repository Manager creates the required Data Insight objects in the InformationSteward repository. The Information Steward installer invokes the Repository Manager automatically when creating the repository the first time the installer is run.
Server Manager:-
The Server Manager creates the Information Steward job server group and jobservers and associates them to the Information Steward repository.To add job servers to the Information Steward job server group, you must manually invoke the Server Manager.
SAP Business Objects BI Platform Components and usages for Information Steward:-
The following table describes how SAP Data Services and SAP Information Steward use each pertinent
SAP BusinessObjects Business Intelligence (BI) platform component or SAP BusinessObjects Information
Platform Services (IPS) if you are not using Business Intelligence clients such as Web Intelligence
documents or Crystal Reports
BI platform or |
Usage for Data Services |
Usage for Information Steward |
Web Tier |
Deploys Data Services on the Central |
Deploys Information Steward on: |
Central Management |
Used by IT administrator to manage: |
Used by IT administrator to manage: |
Central Management |
Maintains a database of information |
Maintains a database of information |
Platform |
The Platform Scheduling Services is not |
Information Steward Job Server uses the |
Platform Processing |
Required during Data Services installation |
Required during Information Steward |
File Input |
Stores input and output files associated |
Stores files associated with: |
Information Steward Components and their usages:-
The following table shows the Information Steward components that you can choose on the “Select Features” window of the installer
Component |
Description |
Information Steward Web Application |
Provides web applications that: |
Information Steward Task Server |
Processes profile and rule tasks for the Data Insight module of Information |
Application Service |
• Provides the Information Steward application the ability to access |
Information Steward Data Review |
Processes match review tasks for the Data Review module of Information |
Metadata Search Service |
Provides search capability on the Metadata Management module of Information |
Information Steward Cleansing |
Provides the capability to create and refine cleansing packages using |
Information Steward Data Review |
Checks if the input table contains new match groups and whether the |
SAP BusinessObjects Enterprise |
Collects information from an SAP BusinessObjects Business Intelligence |
SAP NetWeaver Business Warehouse |
Collects information from a NetWeaver Business Warehouse system |
Common Warehouse Model |
Collects information from the CWM Relational Package that includes |
Relational Databases (RDBMS) |
Collects information from an RDBMS that includes definitions of metadata |
SAP HANA Metadata Integrator |
Collects information from an SAP HANA database that includes definitions |
SAP Data Services Metadata Integrator |
Collects information from an SAP Data Services repository that includes |
SAP Data Federator Metadata |
Collects information from an SAP Data Federator repository that includes |
Meta Integration Metadata Bridge |
Collects the following metadata from other sources: |
For the MIMB's, are the bridges for ER/Studio Data Architect and ER/Studio Repository up to date? Last I checked they were only up to date to connect to ER/Studio's Data Architect 9.6 and Repository 6.6. The repository for ER/Studio is now at 7.0.1 and Data Architect is now at 16.0.
Hi Josh,
You should open a SAP Support Incident regarding this as I know it is specific to the exact version of Information Steward you are using. We do have someone in support who can assist you.
Thanks,
Julie