Skip to Content
Author's profile photo Former Member

Information Steward architecture for quick reference

AIM:- The purpose of the document is to provide basic quick understanding of what is Information Steward and how you can help your colleagues in the organization to resolve any technical issues by understanding the architecture and where does it fit in SAP Data Management space and how it relates to SAP Analytics.

SAP Data Management:- SAP offers various products which allows integration of disparate source systems , collect/extract data, transform, cleanse, validate and load /distribute data to different target systems like SAP Data Services, SAP Master Data Management, SAP Adaptive Server, SAP Information Steward etc.

SAP Analytics:- SAP offers various products which allow the business analysts to query, report and analyze the information that is loaded using SAP Data Management products  so that they can predict the forecasts and take decision accordingly like SAP Business Objects Enterprise, SAP Crystal Reports, SAP Dashboards

What is SAP Information Steward:- SAP Information Steward is part of SAP Enterprise Information suites under Data Management space that allows data stewards, business analysts to discover, assess, define, monitor and improve the quality of the data via several modules:-

  • Data Insight: Profile data, create and run validation rules, monitor data quality through scorecards,
  • and create data cleansing solutions based on your data’s content-type identification results and SAP
  • best practices for your specific data.
  • Metadata Management: Catalog the metadata across their system landscape, analyze and understand

              the relationships of their enterprise data.

  • Metapedia: Define business terms for data and organize the terms into categories.
  • Cleansing Package Builder: Define cleansing packages to parse and standardize data.
  • Match Review: Review results of automated matching on a regular basis and make any necessary corrections.

Match Review maintains a list of records in the “My Worklist” tab that involves reviewers’

                actions for match decisions.

Pre-requisites for SAP Information Steward landscape:-

In the typical Information Steward landscape, you must first install one of the following products. These products provide platform services such as security, scalability, and high availability for Data Services and Information Steward.

  • SAP BusinessObjects Information Platform Services (IPS) if you only want to use features of Information Steward
  • SAP BusinessObjects Business Intelligence platform (BI platform) if you also want to use Business Intelligence clients such as Web Intelligence documents or Crystal Reports

The following diagram helps to understand where Information Steward sits among SAP Data Services and SAP Business Objects Business Intelligence platform.

is arch.jpg

SAP Information Steward architecture and relationship with SAP Data Services and SAP Business Objects Business Intelligence Platform:

SAP Information Steward uses SAP BusinessObjects Business Intelligence (BI) platform and SAP DataServices and inherits the scalable architecture that these two platforms provide.

SAP Information Steward requires SAP BusinessObjects BI Platform for the following functionality:

  • Manage user and group security
  • Schedule and run on-demand services for Metadata Management (integrator sources and utilities)
  • Schedule and run on-demand services for Data Insight (profiling and rule tasks and utilities)
  • Perform administrative tasks for Information Steward with the Central Management Console (CMC)
  • Scalability by load balancing and high availability

SAP Information Steward requires the following components of SAP Data Services:

  • The Data Services Job Server installed on the primary computer.
  • The Data Services Job Server provides the engine processes that perform the Data Insight data

profiling and validation rule tasks. The engine processes use parallel execution and in-memory

processing to deliver high data throughput and scalability.

  • The Data Services Metadata Browsing Service provides the capabilities to browse and import the

metadata from Data Insight connections.

  • The Data Services View Data Service provides the capabilities to view the source data from Data

Insight connections

The Data Services Job Server provides the following system management tools that are required during the first installation of

Information Steward:

Repository Manager:- The Repository Manager creates the required Data Insight objects in the InformationSteward repository. The Information Steward installer invokes the Repository Manager automatically when creating the repository the first time the installer is run.

Server Manager:-

The Server Manager creates the Information Steward job server group and jobservers and associates them to the Information Steward repository.To add job servers to the Information Steward job server group, you must manually invoke the Server Manager.

SAP Business Objects BI Platform Components and usages for Information Steward:-

The following table describes how SAP Data Services and SAP Information Steward use each pertinent

SAP BusinessObjects Business Intelligence (BI) platform component or SAP BusinessObjects Information

Platform Services (IPS) if you are not using Business Intelligence clients such as Web Intelligence

documents or Crystal Reports

BI platform or
IPS component

Usage for Data Services

Usage for Information Steward

Web Tier

Deploys Data Services on the Central
Management Console (CMC) through
which administrative tasks for Data Services
are performed.

Deploys Information Steward on:
• The CMC through which administrative
tasks for Information Steward are
performed.
• A web application server through
which you access the Information
Steward modules

Central Management
Console
(CMC)

Used by IT administrator to manage:
• SAP solutions for Enterprise Information
Management (EIM) Adaptive
Processing Server and services
• User security (authentication and authorization)
• Repository and application settings

Used by IT administrator to manage:
• EIM Adaptive Processing Server and
services
• Information Steward Job Server and
services
• Metadata Management module
• Data Insight module
• Cleansing Package Builder module
• Data Review module connections and
tasks
• Information Steward utilities
• User security (authentication and authorization)
• Repository and application settings

Central Management
Server
(CMS)

Maintains a database of information
about your SAP BusinessObjects BI
platform system. The data stored by the
CMS includes information about users
and groups, security levels, schedule information,
BI platform content, and
servers.
For more information about the CMS,
see the SAP BusinessObjects Business
Intelligence Platform Administrator’s
Guide. If you installed IPS, see the SAP
BusinessObjects Information platform
services Administrator’s Guide.
Data Services relies on the CMS for:
• Centralized user and group management
• Flexible authentication methods
• Password enforcement policies

Maintains a database of information
about your BI platform system. The data
stored by the CMS includes information
about users and groups, security levels,
schedule information, BI platform content,
and servers.
The following objects in the Metadata
Management module of Information
Steward are stored in the CMS.
• Integrator Source configurations
• Source groups
• Utilities configurations
• Data Insight connections
• Projects
• Tasks
Note:
Because integrator source configurations
and source group definitions are stored
in the CMS, you can use the Upgrade
management tool to move them from one
version of the CMS to another. The
schedules and rights information are
considered dependencies of these configurations.
For details, see the SAP
BusinessObjects Information Steward
Upgrade Guide.

Platform
Scheduling
Services

The Platform Scheduling Services is not
required to run Data Services. If you will
not install Information Steward, you can
stop the Platform Scheduling Services.

Information Steward Job Server uses the
Platform Scheduling Services for executing
profiling tasks and integrator tasks.
The server may host the following services
for Information Steward:
• Task Scheduling Service
• Integrator Scheduling Service
• Data Review Scheduling Service

Platform Processing
Services

Required during Data Services installation
to create the Enterprise Information
Management (EIM) Adaptive Processing
Server .
The EIM Adaptive Processing Server
uses the Platform Processing Services
to host the following services:
• RFC Server Service, which Data
Services requires for connectivity with
SAP NetWeaver Business Warehouse
• Job Launcher Service, which Data
Services uses to send batch jobs to
the appropriate Data Services Job
Server for execution.
• Data Services Metadata Browsing
Service, which is used by other applications
such as Information Steward
to browse and import metadata from
Data Insight connections
• Data Services View Data Service,
which is used by other applications
such as Information Steward to view
data in Data Insight connections
The Data Services Workbench requires
the Enterprise Information Management
Adaptive Processing Server for operations
such as testing connections and all
communication with the repository to
deploy object, run jobs, and so forth.
The Platform Processing Service can be
stopped after installation because on the
Enterprise Information Management
Adaptive Processing Server is required
for Data Services to run.

Required during Information Steward
installation to create the Enterprise Information
Management Adaptive Processing
Server .
The EIM Adaptive Processing Server
uses the Platform Processing Services
to host the following services:
• Metadata Search Service
• Metadata Integrator Service
• Data Services Metadata Browsing
Service
• Data Services View Data Service
• Information Steward Administrator
Task Service
• Cleansing Package Builder Core
Service
• Cleansing Package Builder Autoanalysis
Service
• Cleansing Package Builder Publishing
Service
• Data Review Processing Service
• Data Cleansing Advisor Service
• Application Service

File Input
Repository
Server and
File Output
Repository
Server

Stores input and output files associated
with:
• A published cleansing package. The
stored information can be accessed
by Data Services

Stores files associated with:
• A published cleansing package. The
stored information can be accessed
by Data Services
• History logs for Data Insight task and
metadata integrator execution
• Search index for metadata integrator
objects

Information Steward Components and their usages:-

The following table shows the Information Steward components that you can choose on the “Select Features” window of the installer

Component

Description

Information Steward Web Application

Provides web applications that:
• Administer Information Steward in the Central Management Console
(CMC)
• Display the Information Steward user interface
Note:
If the BI Platform Web applications were manually deployed, manually
deploy the Information Steward Web application with WDeploy after installation.

Information Steward Task Server

Processes profile and rule tasks for the Data Insight module of Information
Steward.

Application Service

• Provides the Information Steward application the ability to access
(read and write) the Information Steward repository.
• Processes object relationships (for example, impact and lineage).

Information Steward Data Review
Server

Processes match review tasks for the Data Review module of Information
Steward.

Metadata Search Service

Provides search capability on the Metadata Management module of Information
Steward

Information Steward Cleansing
Package Builder Service

Provides the capability to create and refine cleansing packages using
the Cleansing Package Builder module of Information Steward

Information Steward Data Review
Service

Checks if the input table contains new match groups and whether the
match results are ready for review, and creates match review tasks for
the Data Review module of Information Steward.

SAP BusinessObjects Enterprise
Metadata Integrator

Collects information from an SAP BusinessObjects Business Intelligence
platform repository that includes metadata objects such as SAP Crystal
Reports, Web Intelligence documents, and universes.

SAP NetWeaver Business Warehouse
Metadata Integrator

Collects information from a NetWeaver Business Warehouse system
that includes metadata objects such as Queries, InfoProviders, InfoObjects,
Transformations, and DataSources.

Common Warehouse Model
(CWM) Metadata Integrator

Collects information from the CWM Relational Package that includes
definitions of metadata objects such as catalogs, schemas, and tables.

Relational Databases (RDBMS)
Metadata Integrator

Collects information from an RDBMS that includes definitions of metadata
objects such as catalogs, schemas, stored procedures, and aliases.
Supported relational databases include DB2, MySQL, Oracle, SQL
Server, Teradata, or a Universe connection using JDBC or ODBC. For
more information, see the Product Availability Matrix.

SAP HANA Metadata Integrator

Collects information from an SAP HANA database that includes definitions
of metadata objects such as databases, schemas, tables, and views.

SAP Data Services Metadata Integrator

Collects information from an SAP Data Services repository that includes
definitions of metadata objects such as source tables and columns for
ETL jobs, datastores and configurations, and flat files.

SAP Data Federator Metadata
Integrator

Collects information from an SAP Data Federator repository that includes
definitions of metadata objects such as projects, catalogs, datasources,
and mapping rules.

Meta Integration Metadata Bridge
(MIMB) Metadata Integrator

Collects the following metadata from other sources:
• Data Modeling metadata such as Sybase Power Designer, Embarcadero
ER/Studio, and Oracle Designer
• ETL metadata such as Oracle Warehouse Builder and Microsoft SQL
Server Integration Services (SSIS)
• OLAP and BI metadata such as IBM DB2 Cube Views, Oracle OLAP,
and Cognos 8 BI Reporting

All these above information can be quickly used as reference whenever there is any issue with certain module and can be mitigated without getting much deeper into larger guides.

Assigned Tags

      2 Comments
      You must be Logged on to comment or reply to a post.
      Author's profile photo Former Member
      Former Member

      For the MIMB's, are the bridges for ER/Studio Data Architect and ER/Studio Repository up to date? Last I checked they were only up to date to connect to ER/Studio's Data Architect 9.6 and Repository 6.6. The repository for ER/Studio is now at 7.0.1 and Data Architect is now at 16.0.

      Author's profile photo Julie Oliver
      Julie Oliver

      Hi Josh,

      You should open a SAP Support Incident regarding this as I know it is specific to the exact version of Information Steward you are using. We do have someone in support who can assist you.

      Thanks,
      Julie