Collecting and managing metadata
In the previous section, we looked at how data can be cataloged using Microsoft Purview. The built-in Microsoft Purview scanners scan and ingest basic technical metadata from data sources. This includes file types, column names, column types, and basic out-of-the-box classifications. However, this initial technical metadata is extracted from the data source purely based on the definitions available in the data source itself. Some data sources, such as Microsoft SQL Server, maintain significant amounts of data relating to the schema and its relationships. But others, such as CSV files stored in blob storage, do not have any information other than a column header. Hence, after the initial scan and ingest cycle, the governance team needs to get to work editing and enhancing the metadata to make the data assets more meaningful.
The real advantage of cataloging data and making it searchable is to make data more meaningful to the users. Users searching...