Custom Data Quality Reporting with the Failed Data Repository
The Failed Data Repository
If you are looking to build some custom reports on the results of your data quality assessment – beyond what is available with the Information Steward’s Data Quality Dashboards – you can leverage the Failed Data Repository as the database to meet your custom reporting needs.
The Failed Data Repository provides you information about failed data from your validation rules within a supported relational database system. Information includes:
- Information about the project, connection and tables which generated the failed data (IS_FD_TABLES_MAP table)
- Execution history of all the tasks which generated the failed data (IS_FD_RUN_HISTORY table)
- All failed rules for a given run (IS_FD_RULE_INFO table)
- All the rows that failed one or more rules (<table_alias>_FD table)
- All the rules which failed a given row (<table_alias>_FR table)
For detailed information about the above referenced tables, see the section on “Accessing additional failed data information” in the Information Steward User Guide. The diagram below shows the relationships between the failed data tables.
As an example, the total rows that were validated during the run is available in the IS_FD_RUN_HISTORY table (TOTAL_ROWS column) for each IS task. And, you can join the *_FD tables to get at the failed data counts per rule/task.
Setting Up the Failed Data Repository
To leverage a Failed Data Repository for custom reporting, you must first establish a connection to the database within the Central Management Console (CMC). Ensure that the connection type is “For data that failed rules.” The image below shows an example of the connection parameters, many of which will change depending on the Database Type selected. For the most current listing of supported databases, please check out the SAP Information Steward PAM.
Specifying the Failed Data Repository
When executing a rule or set of rules (per task), you can select to save the failed data to one of the Failed Data Repositories that you have previously configured.
Viewing Failed Data from Information Steward
This is the Data Quality Scorecard Detailed View, from here you can view the failed data:
This gives you the Failed Data screen. Once you have set up the Failed Data Repository, this will give you access to “View More Failed Data” to get beyond the 500 record sample data size.
The Information Steward Repository Views
Although the Information Steward Repository is not a supported means to exact data from custom reporting, here are a few Information Steward Repository Views that may contain some additional information to meet your needs:
- Key Data Domain, Quality Dimension, or Rule level
- Quality Dimension descriptions
- Rule definition/description
- Project Names
- How to use SAP BusinessObjects Information Steward, along with the SAP Business Intelligence platform components SAP BusinessObjects Information Design Tool and SAP BusinessObjects BI launch pad, to produce a Web Intelligence report that will help you analyze the quality of your data.