How to Build Cleansing Packages in SAP Information Steward

  • by Virendra K. Soni, SAP Data Services Consultant, SAP
  • Sunil Mehta, Solution Architect, SAP
  • Biswajit Biswas, Subject Matter Expert in SAP Analytics, SAP GD
  • Deepinder Singh, SAP Analytics Solution Architect
  • June 26, 2014
SAP Information Steward is an innovative and collaborative solution for more effective data governance. It combines metadata management, data profiling, and data cleansing into one solution. It also provides a single platform for both data stewards and business analysts to govern their data assets. This step-by-step guide demonstrates how to build a new custom cleansing package using SAP Information Steward’s Cleansing Package Builder module.
Learning Objectives

Reading this article you will learn how to:

  • Build custom cleansing packages using the Cleansing Package Builder feature of Information Steward
  • Publish a custom cleansing package
Key Concept

In the Cleansing Package Builder module of SAP Information Steward, the data steward (a person who understands the business data as well as the data management concept) analyzes the input or sample data and defines the standard forms and variations based on this information. Cleansing Package Builder automatically creates parsing rules based on how the input data is classified with standard forms and variations. These cleansing packages are published to SAP Data Services and then used in the Base Data Cleansing Transform to build a Data Cleansing batch job.

In our example scenario, a well-known retail company wants to cleanse its inventory data. The current inventory data is not yielding the correct results in the company’s inventory report. The company uses SAP Data Services for validation and cleansing. SAP Data Services has its own standard cleansing packages, but these standard packages are not sufficient in this case. Therefore, the company needs to build custom cleansing packages in SAP Information Steward that can be consumed in SAP Data Services for cleansing its inventory data.

Following are step-by-step details for how we solved this issue using SAP Information Steward.

Virendra K. Soni

Virendra K. Soni is a Certified SAP Data Services Consultant and is currently associated with SAP. He has 6.5 years of industry experience in Data Migration, Data Conversion, and Data ware Housing with the Retail and Self-insurance industries. Prior to SAP he has worked with Capgemini, CSC and HCL Technologies.

See more by this author

Sunil Mehta

Sunil Mehta is a solution architect at SAP. He received his master’s degree in Computer Management from Symbiosis in Pune, India. He is a certified SAP FI/CO/BOBJ consultant, working in analytics. During his career he has been associated with Accenture, IBM, Capgemini, and KPMG, and has worked in various roles, including as a consultant solution architect and a project manager.

See more by this author

Biswajit Biswas

Biswajit Biswas works at SAP GD and is a subject matter expert in SAP analytics. He has five years’ experience. He is proficient in the SAP BusinessObjects suite of reporting tools and SAP Data Services. He has been associated with development of Rapid Deployment Solutions for analytics on SAP HANA, focusing on the utilities industry.

See more by this author

Deepinder Singh

Deepinder Singh is an SAP analytics solution architect with a focus on expert consulting. He has worked with CMC Limited and Accenture, and has catered to clients across multiple industries such as utilities, mining, real estate, and chemicals. Throughout his career he has assisted clients in planning their technology roadmaps and execution as planned in the capacity of program or project manager.

See more by this author


Comments

No comments have been submitted on this article. 


Please log in to post a comment.

To learn more about subscription access to premium content, click here.