Hi
In this blog I am going to share my experience while exploring and working on one of the Customer's data cleansing and reporting project. This Project involves multiple SAP Technologies like :SAP BODS ,SAP Information Steward,SAP Business Objects(BO) , SQL Server(database,client's requirement,Non SAP),
Such kind of projects have lots of data to be cleansed , put in other words, required to be in the desired format as the client needs.The Client will be sharing their requirements beforehand. Here we used SAP Information Steward to create various rules(involving Regex functions) , which are used in SAP BODS to cleanse and transform the data. Using SAP BODS we create various types of transforms which helps in cleansing the data process very smoother and easier. The data might be cleansed in various ways:Automated or Manual.However majority of the tasks are being carried by the automated cleansing process through the tools(Information steward and BODS).
SQL Server is the back-end database, being used at the client side to maintain their data.Hence the created tables in SAP BODS and SAP Information Steward are being stored on it.
The volume of data to be cleansed may vary across projects. In our case, we divided the data load and cleanse processes into 4 batch job load process (comprising 250 K data in each batch).
The project involved reporting as well. The customers wanted to display the amount of records cleansed,the changes which were affected to specific fields which were requested, count of records before cleanse and after cleanse and many more things, for which a universe fetching data from SQL server( which holds the cleansed data during each batch load) was created, and then using the Webi (Web-Intelligence) the reports were created.
Very soon ,will be adding few images describing the process flow.
Till Then Happy Learning
Thank You