Scrubbing data involves making the data consistent. It is possible that the same data is represented in different ways in multiple operational systems. For example, a product name might be abbreviated in one operational system, but not in another. If the two values were not made consistent, any queries using the data likely would evaluate the values as different products. If the detail data in the data warehouse is to produce consistent information, the product name must be made consistent for all values.
Data scrubbing can be achieved:
For example, the data in the order entry system should be scrubbed, such as the values for:
Using the DTS Import and DTS Export wizards, an ActiveX script can be executed during the copy process, that checks the State value, and changes it to a known two-character value. Alternatively, the ProductName value could be scrubbed by writing a Visual C++ program that calls the DTS API to execute Microsoft JScript® scripts , and other executable modules.
Column Mappings | DTS Lookup |
Data Transformation Services Import and Export Wizards | Programming DTS Applications |