Data lake does cost money to build and manage. So, the expectation from various parties from Data Lake is quite demanding and varied in nature. Let's divide these expectation into two based on parties involved.
Expectation from business users:
- Analysis is always running on right data with good quality attributes.
- Capability to easily manage data governance.
- Setup security measures whereby the data visibility can be controlled in more fine grained fashion. Easy data masking capability, when needed by employing appropriate transformations controlled by authorizations mechanisms.
- Self service capability with minimal technical knowledge for a broad spectrum of people.
- More easy representation of data lineage and traceability
- Should be able to support metadata management
Data lineage is defined as a data life cycle that includes the data's origins and where it moves over time....