Data transformation is a set of techniques used to convert data from one format or structure to another format or structure. The following are some examples of transformation activities:
- Data deduplication involves the identification of duplicates and their removal.
- Key restructuring involves transforming any keys with built-in meanings to the generic keys.
- Data cleansing involves extracting words and deleting out-of-date, inaccurate, and incomplete information from the source language without extracting the meaning or information to enhance the accuracy of the source data.
- Data validation is a process of formulating rules or algorithms that help in validating different types of data against some known issues.
- Format revisioning involves converting from one format to another.
- Data derivation consists of creating a set of rules to generate more information from the...