Search icon CANCEL
Subscription
0
Cart icon
Cart
Close icon
You have no products in your basket yet
Save more on your purchases!
Savings automatically calculated. No voucher code required
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Data Modeling for Azure Data Services

You're reading from  Data Modeling for Azure Data Services

Product type Book
Published in Jul 2021
Publisher Packt
ISBN-13 9781801077347
Pages 428 pages
Edition 1st Edition
Languages
Concepts
Author (1):
Peter ter Braake Peter ter Braake
Profile icon Peter ter Braake
Toc

Table of Contents (16) Chapters close

Preface 1. Section 1 – Operational/OLTP Databases
2. Chapter 1: Introduction to Databases 3. Chapter 2: Entity Analysis 4. Chapter 3: Normalizing Data 5. Chapter 4: Provisioning and Implementing an Azure SQL DB 6. Chapter 5: Designing a NoSQL Database 7. Chapter 6: Provisioning and Implementing an Azure Cosmos DB Database 8. Section 2 – Analytics with a Data Lake and Data Warehouse
9. Chapter 7: Dimensional Modeling 10. Chapter 8: Provisioning and Implementing an Azure Synapse SQL Pool 11. Chapter 9: Data Vault Modeling 12. Chapter 10: Designing and Implementing a Data Lake Using Azure Storage 13. Section 3 – ETL with Azure Data Factory
14. Chapter 11: Implementing ETL Using Azure Data Factory 15. Other Books You May Enjoy

Using hash keys

Hash keys were introduced in Data Vault 2.0 and play a central role in the design. One advantage of using hash keys is that both Hubs and Links can (and often will) have a composite key. This makes the keys large and inefficient. By creating a single hash value, the key becomes more efficient. This is not true for Hubs with small keys. In that case, hash keys are likely to be more inefficient. But there are other arguments. One other argument is that we want all tables to have the same structure. So, we also use a hash key, even when that is less efficient.

The most important advantage of using hash keys is the efficiency gain of the load process. To explain this, consider a star schema with surrogate keys. You need to load the dimension tables first. The surrogate keys are created during the insert of the new dimension rows. After all the dimensions have finished loading, you can start to load the fact table. The ETL process, which gets fact rows with source keys...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at ₹800/month. Cancel anytime