High-quality data needs to pass a set of quality criteria. For instance, if the addresses are inconsistent, the company will suffer the cost of resending mail or even losing customers. Many companies use customer information databases that record data like contact information, addresses, and preferences. In the business world, incorrect data can be costly.
![ms clean case ms clean case](https://storage-asset.msi.com/global/picture/about/FAQ/dt/os-installation-1.jpg)
In this case, it will be important to have access to reliable data to avoid erroneous fiscal decisions. For instance, the government may want to analyze population census figures to decide which regions require further spending and investment on infrastructure and services.
![ms clean case ms clean case](https://images-na.ssl-images-amazon.com/images/I/51ZG+NdsbOL._SX325_BO1,204,203,200_.jpg)
Data cleansing may be performed interactively with data wrangling tools, or as batch processing through scripting. Not to be confused with Sanitization (classified information) or Data scrubbing.ĭata cleansing or data cleaning is the process of detecting and correcting (or removing) corrupt or inaccurate records from a record set, table, or database and refers to identifying incomplete, incorrect, inaccurate or irrelevant parts of the data and then replacing, modifying, or deleting the dirty or coarse data.