Why is data quality important?
In the previous chapter, we discussed how the data is distributed across the data mesh inside each data product. The responsibility of maintaining the data quality and availability is with the data owners. We also discussed how data products could be built using data from other data products. This could create a chain of dependency between different products. Failure of one data product to provide data in a timely manner could have a domino effect on the rest of the data product chain.
We drew a parallel between API management and data contracts. It’s important that data schemas are versioned and provide all the relevant information to its consumers so that they can make an informed decision about using the data. This is just like how APIs are managed and maintained.
Data contracts ensure that data is available when it says it’s going to be available and is of the same schema version written in the contract. But what about the data...