Data Lineage
Data Lineage is the capability that tracks and documents the journey data takes through an organization’s systems. Data lineage shows how data flows through the systems, how it is changed or manipulated along the journey, and where it is stored and ultimately used. In short, data lineage provides transparency into the data. This is critical because transparency helps us drive trust in our data. When we know what the data is and where it comes from, we trust it more, and it is a key component in understanding the data and believing what it tells us.
The other key component of a data lineage capability is it tracks the flow of data over time. Unlike some of the other data capabilities in this book, data lineage provides information about the data life cycle. By providing a clearer understanding of where data originated, how it may have been changed, and where it ultimately resides, we are able to see all the transformations the data went through along its life...