Summary
Data Vault 2.0 is designed to address the challenges of managing large, complex, and rapidly changing data environments. It is a hybrid approach that combines elements of 3NF and star schema and uses a standardized, repeatable design pattern that can be applied to any dataset, regardless of size or complexity.
Data Vault design begins by defining the business model and constructing the base layer, known as the Raw Vault. The Raw Vault contains the following elements:
- Hubs – natural keys that identify business entities
- Links – store the interactions between business entities
- Satellites – store the descriptions and attributes of business entities
- Reference tables – include descriptive information and metadata
On top of the Raw Vault, a Business Vault is constructed to meet changing business needs and requirements without disrupting the overall data architecture. Next, domain-oriented information marts are built to meet...