Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Save more on your purchases! discount-offer-chevron-icon
Savings automatically calculated. No voucher code required.
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletter Hub
Free Learning
Arrow right icon
timer SALE ENDS IN
0 Days
:
00 Hours
:
00 Minutes
:
00 Seconds
Arrow up icon
GO TO TOP
Data Engineering Best Practices

You're reading from   Data Engineering Best Practices Architect robust and cost-effective data solutions in the cloud era

Arrow left icon
Product type Paperback
Published in Oct 2024
Publisher Packt
ISBN-13 9781803244983
Length 550 pages
Edition 1st Edition
Languages
Arrow right icon
Authors (2):
Arrow left icon
David Larochelle David Larochelle
Author Profile Icon David Larochelle
David Larochelle
Richard J. Schiller Richard J. Schiller
Author Profile Icon Richard J. Schiller
Richard J. Schiller
Arrow right icon
View More author details
Toc

Table of Contents (21) Chapters Close

Preface 1. Chapter 1: Overview of the Business Problem Statement FREE CHAPTER 2. Chapter 2: A Data Engineer’s Journey – Background Challenges 3. Chapter 3: A Data Engineer’s Journey – IT’s Vision and Mission 4. Chapter 4: Architecture Principles 5. Chapter 5: Architecture Framework – Conceptual Architecture Best Practices 6. Chapter 6: Architecture Framework – Logical Architecture Best Practices 7. Chapter 7: Architecture Framework – Physical Architecture Best Practices 8. Chapter 8: Software Engineering Best Practice Considerations 9. Chapter 9: Key Considerations for Agile SDLC Best Practices 10. Chapter 10: Key Considerations for Quality Testing Best Practices 11. Chapter 11: Key Considerations for IT Operational Service Best Practices 12. Chapter 12: Key Considerations for Data Service Best Practices 13. Chapter 13: Key Considerations for Management Best Practices 14. Chapter 14: Key Considerations for Data Delivery Best Practices 15. Chapter 15: Other Considerations – Measures, Calculations, Restatements, and Data Science Best Practices 16. Chapter 16: Machine Learning Pipeline Best Practices and Processes 17. Chapter 17: Takeaway Summary – Putting It All Together 18. Chapter 18: Appendix and Use Cases 19. Index 20. Other Books You May Enjoy

What this book covers

  • Chapter 1, Overview of the Business Problem Statement, provides a definition of the business problem faced by the data engineer. It also provides an introduction to the entire book.
  • Chapter 2, A Data Engineer’s Journey – Background Challenges, elaborates on the challenges faced when building a modern data system.
  • Chapter 3, A Data Engineer’s Journey – IT’s Vision and Mission, illustrates various mission and vision statements and urges you to develop one if one does not already exist. This way, you can keep your focus on the end and not deviate from your strategy.
  • Chapter 4, Architecture Principles, elaborates on the need to develop principles that keep you solidly grounded in reality. Many examples are provided and explained because they drive the best practices.
  • Chapter 5, Architecture Framework – Conceptual Architecture Best Practices, depicts architecture as the framework for design engineering. Too often projects go off the rails because the architecture shifts and the structure of the engineering design falls apart. Architecture is a communication tool to keep consensus, especially when things go wrong – and they always do in any engineering effort.
  • Chapter 6, Architecture Framework – Logical Architecture Best Practices, describes the need to formally define and document the architecture for all, thus tying the conceptual level to the physical level of the architecture.
  • Chapter 7, Architecture Framework – Physical Architecture Best Practices, defines what will be built and eventually what was built and where it all operates.
  • Chapter 8, Software Engineering Best Practice Considerations, elaborates on the software best practices needed for the data engineering effort to succeed.
  • Chapter 9, Key Considerations for Agile SDLC Best Practices, discusses the project management and development processes needed to deliver a data solution.
  • Chapter 10, Key Considerations for Quality Testing Best Practices, provides testing best practices for a data factory.
  • Chapter 11, Key Considerations for IT Operational Service Best Practices, defines operational requirements for a data solution.
  • Chapter 12, Key Considerations for Data Service Best Practices, elaborates on data services, where the focus is on refining raw data into a gem, like a diamond, with facets. It takes the focus away from servicing data as a blob. Examples are provided to illustrate this important message.
  • Chapter 13, Key Considerations for Management Best Practices, gets into the details of data factory curation and processing with a focus on difficult problems to solve.
  • Chapter 14, Key Considerations for Data Delivery Best Practices, continues Chapter 13’s theme but addresses difficult problem areas for a business and the impediments that can be overcome with the best practices presented.
  • Chapter 15, Other Considerations – Measures, Calculations, Restatements and Data Science Best Practices, defines the analysis workbench and various tools and processes for the data consumer. This is what is necessary to deliver data at the end of the data factory.
  • Chapter 16, Machine Learning Pipeline Best Practices and Processes, dives deeper into machine learning/deep learning, Generative AI (GenAI), and ways to apply knowledge engineering to cooperatively address the future vision where AI takes center stage.
  • Chapter 17, Takeaway Summary – Putting It All Together, presents the book’s conclusion and parting wishes for the development of your future-proof data engineering designs.
  • Chapter 18, Appendix and Use Cases, delivers on the promise to elaborate on a few high-level use cases with a primer on the technologies used in those use cases.
lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image