Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletter Hub
Free Learning
Arrow right icon
timer SALE ENDS IN
0 Days
:
00 Hours
:
00 Minutes
:
00 Seconds
Engineering Data Mesh in Azure Cloud
Engineering Data Mesh in Azure Cloud

Engineering Data Mesh in Azure Cloud: Implement data mesh using Microsoft Azure's Cloud Adoption Framework

eBook
$27.98 $39.99
Paperback
$49.99
Subscription
Free Trial
Renews at $19.99p/m

What do you get with a Packt Subscription?

Free for first 7 days. $19.99 p/m after that. Cancel any time!
Product feature icon Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!
Product feature icon 50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.
Product feature icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Product feature icon Thousands of reference materials covering every tech concept you need to stay up to date.
Subscribe now
View plans & pricing
Table of content icon View table of contents Preview book icon Preview Book

Engineering Data Mesh in Azure Cloud

Introducing Data Meshes

Before we start designing and implementing a data mesh architecture, it is important to understand Why consider a data mesh? This chapter briefly walks through the history of business intelligence (BI) and analytics. We will go through the events and transitions of how analytics has evolved over the last few decades and the current challenges that make a data mesh architecture an alternative to traditional centralized analytical systems.

In this chapter, we’re going to cover the following main topics:

  • Exploring the evolution of modern data analytics
  • Discovering the challenges of modern-day enterprises
  • Data as a product (DaaP)
  • Data domains
  • The data mesh solution

Exploring the evolution of modern data analytics

After the advent of databases in the late 1970s and early 1980s, databases were treated as a central source of truth (SOT) and designed to record transactions and produce daily, weekly, and monthly financial reports. These are largely termed online transaction processing (OLTP) systems.

In the late 1980s, businesses felt the need to understand how their business was performing and investigate any changes to sales, production, revenue, or any other important aspects of the business so that they could run their businesses more efficiently. But in order to conduct this investigation, they had to run complex queries across all tables in their database and be able to slice and dice the data to dig deeper into it. They also had to aggregate values in order to find totals and averages across a period of time. A relational model that spread data across multiple tables was needed to aggregate and join data across these tables. As a result...

Discovering the challenges of modern-day enterprises

As enterprises get bigger, the ability to be agile and competitive becomes challenging. Some enterprises have large departments for sales, marketing, engineering, and so on. Many large corporations split their global businesses into regions or zones. Each of these sub-organizations operates as an independent unit. They have their own business complexity, analytics needs, and speed at which analytical output is required. They choose their tools according to their requirements. But at the end of the day, they are asked to move their data to a central lakehouse or data warehouse for enterprise-wide analytics, which uses a specific set of tools and mandates a certain data format. This strategy has multiple challenges:

  • The sub-organizations are forced to use the tools in the central analytical system to perform their analytics.
  • The sub-organizations then start building their own local analytical platforms to speed up their...

DaaP

Historically, data has always been treated as a backend. It was used by the middle tier and then surfaced to the frontend. Applications did not do a lot with the data other than aggregating and presenting it with better visuals. Relational database systems also ensured that data adhered to a schema and format and that all mandatory fields were populated. As a result, applications received quality data and had to do minimal checks on quality. But with semi-structured data, this equation changes. Semi-structured data does not comply with fixed schemas and rules of how data is formatted and populated. Advanced analytics, ML, and big data analytics need a lot of processing on the data before it’s consumed by any algorithm and application. ML algorithms provide exponentially accurate output as the volume of quality data increases.

In a paper published in 2001 (https://homl.info/6), Microsoft researchers Michele Banko and Erik Brill showed that different ML algorithms performed...

Data domains

One of the concepts defined in https://www.thoughtworks.com/what-we-do/data-and-ai/data-mesh is the concept of data domains. Data domains are defined as a logical grouping of data and teams aligned with certain business domains, such as sales, marketing, or production. While each of these domains may have multiple data products, all the teams and the data used to build these products fall under the same domain. This domain team is responsible for managing and maintaining the data in their domain. This is described as domain ownership.

However, in reality, we have found that adopting a domain as a concept could be challenging for many companies as every company has its own structure. For example, large global companies that run their business in different geological zones have sales, marketing, production, and local finance departments in every location. Each of these departments works independently based on their local market and country requirements. Clubbing all sales...

The data mesh solution

As we learned in the Discovering the challenges of modern-day enterprises section, having a central data lake or data warehouse has several disadvantages, especially for large organizations. In the previous section, we learned about changes in data processing requirements driven by ML and advanced analytics and how data now needs to be treated like a product with its own complete life cycle.

To explain a data mesh in one sentence, a data mesh is a centrally managed network of decentralized data products. The data mesh breaks the central data lake into decentralized islands of data that are owned by the teams that generate the data. The data mesh architecture proposes that data be treated like a product, with each team producing its own data/output using its own choice of tools arranged in an architecture that works for them. This team completely owns the data/output they produce and exposes it for others to consume in a way they deem fit for their data. Here...

Summary

We saw in this chapter how data analytics evolved over time as technology advanced and as business needs changed. One of the main objectives of walking through this history is to realize that, once again, we are at the cusp of a change. Data-driven organizations are putting pressure on data products to deliver faster innovation to keep the company ahead of the competitive curve. We also saw how data preprocessing has become critical to modern-day analytics, which uses machine learning for accurate predictions and forecasting. Clean, curated data itself becomes like a product that other products can consume to get innovative insights. This drives the need for a more collaborative and agile analytical environment where data can be discovered and used to build data products, as opposed to the centralized dashboards and reports of the past. A data mesh is one of the ways to bring this agile and collaborative framework to life.

However, a data mesh is a long-term strategy and...

Left arrow icon Right arrow icon
Download code icon Download Code

Key benefits

  • Delve into core data mesh concepts and apply them to real-world situations
  • Safely reassess and redesign your framework for seamless data mesh integration
  • Conquer practical challenges, from domain organization to building data contracts
  • Purchase of the print or Kindle book includes a free PDF eBook

Description

Decentralizing data and centralizing governance are practical, scalable, and modern approaches to data analytics. However, implementing a data mesh can feel like changing the engine of a moving car. Most organizations struggle to start and get caught up in the concept of data domains, spending months trying to organize domains. This is where Engineering Data Mesh in Azure Cloud can help. The book starts by assessing your existing framework before helping you architect a practical design. As you progress, you’ll focus on the Microsoft Cloud Adoption Framework for Azure and the cloud-scale analytics framework, which will help you quickly set up a landing zone for your data mesh in the cloud. The book also resolves common challenges related to the adoption and implementation of a data mesh faced by real customers. It touches on the concepts of data contracts and helps you build practical data contracts that work for your organization. The last part of the book covers some common architecture patterns used for modern analytics frameworks such as artificial intelligence (AI). By the end of this book, you’ll be able to transform existing analytics frameworks into a streamlined data mesh using Microsoft Azure, thereby navigating challenges and implementing advanced architecture patterns for modern analytics workloads.

Who is this book for?

This book is for chief data officers and data architects of large and medium-size organizations who are struggling to maintain silos of data and analytics projects. Data architects and data engineers looking to understand data mesh and how it can help their organizations democratize data and analytics will also benefit from this book. Prior knowledge of managing centralized analytical systems, as well as experience with building data lakes, data warehouses, data pipelines, data integrations, and transformations is needed to get the most out of this book.

What you will learn

  • Build a strategy to implement a data mesh in Azure Cloud
  • Plan your data mesh journey to build a collaborative analytics platform
  • Address challenges in designing, building, and managing data contracts
  • Get to grips with monitoring and governing a data mesh
  • Understand how to build a self-service portal for analytics
  • Design and implement a secure data mesh architecture
  • Resolve practical challenges related to data mesh adoption

Product Details

Country selected
Publication date, Length, Edition, Language, ISBN-13
Publication date : Mar 29, 2024
Length: 314 pages
Edition : 1st
Language : English
ISBN-13 : 9781805120780
Category :
Languages :
Concepts :
Tools :

What do you get with a Packt Subscription?

Free for first 7 days. $19.99 p/m after that. Cancel any time!
Product feature icon Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!
Product feature icon 50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.
Product feature icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Product feature icon Thousands of reference materials covering every tech concept you need to stay up to date.
Subscribe now
View plans & pricing

Product Details

Publication date : Mar 29, 2024
Length: 314 pages
Edition : 1st
Language : English
ISBN-13 : 9781805120780
Category :
Languages :
Concepts :
Tools :

Packt Subscriptions

See our plans and pricing
Modal Close icon
$19.99 billed monthly
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Simple pricing, no contract
$199.99 billed annually
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Choose a DRM-free eBook or Video every month to keep
Feature tick icon PLUS own as many other DRM-free eBooks or Videos as you like for just $5 each
Feature tick icon Exclusive print discounts
$279.99 billed in 18 months
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Choose a DRM-free eBook or Video every month to keep
Feature tick icon PLUS own as many other DRM-free eBooks or Videos as you like for just $5 each
Feature tick icon Exclusive print discounts

Frequently bought together


Stars icon
Total $ 141.97
Engineering Data Mesh in Azure Cloud
$49.99
Solutions Architect's Handbook
$59.99
Principles of Data Fabric
$31.99
Total $ 141.97 Stars icon

Table of Contents

22 Chapters
Part 1: Rolling Out the Data Mesh in the Azure Cloud Chevron down icon Chevron up icon
Chapter 1: Introducing Data Meshes Chevron down icon Chevron up icon
Chapter 2: Building a Data Mesh Strategy Chevron down icon Chevron up icon
Chapter 3: Deploying a Data Mesh Using the Azure Cloud-Scale Analytics Framework Chevron down icon Chevron up icon
Chapter 4: Building a Data Mesh Governance Framework Using Microsoft Azure Services Chevron down icon Chevron up icon
Chapter 5: Security Architecture for Data Meshes Chevron down icon Chevron up icon
Chapter 6: Automating Deployment through Azure Resource Manager and Azure DevOps Chevron down icon Chevron up icon
Chapter 7: Building a Self-Service Portal for Common Data Mesh Operations Chevron down icon Chevron up icon
Part 2: Practical Challenges of Implementing a Data Mesh Chevron down icon Chevron up icon
Chapter 8: How to Design, Build, and Manage Data Contracts Chevron down icon Chevron up icon
Chapter 9: Data Quality Management Chevron down icon Chevron up icon
Chapter 10: Master Data Management Chevron down icon Chevron up icon
Chapter 11: Monitoring and Data Observability Chevron down icon Chevron up icon
Chapter 12: Monitoring Data Mesh Costs and Building a Cross-Charging Model Chevron down icon Chevron up icon
Chapter 13: Understanding Data-Sharing Topologies in a Data Mesh Chevron down icon Chevron up icon
Part 3: Popular Data Product Architectures Chevron down icon Chevron up icon
Chapter 14: Advanced Analytics Using Azure Machine Learning, Databricks, and the Lakehouse Architecture Chevron down icon Chevron up icon
Chapter 15: Big Data Analytics Using Azure Synapse Analytics Chevron down icon Chevron up icon
Chapter 16: Event-Driven Analytics Using Azure Event Hubs, Azure Stream Analytics, and Azure Machine Learning Chevron down icon Chevron up icon
Chapter 17: AI Using Azure Cognitive Services and Azure OpenAI Chevron down icon Chevron up icon
Index Chevron down icon Chevron up icon
Other Books You May Enjoy Chevron down icon Chevron up icon

Customer reviews

Top Reviews
Rating distribution
Full star icon Full star icon Full star icon Full star icon Half star icon 4.5
(6 Ratings)
5 star 50%
4 star 50%
3 star 0%
2 star 0%
1 star 0%
Filter icon Filter
Top Reviews

Filter reviews by




S.Kundu Apr 15, 2024
Full star icon Full star icon Full star icon Full star icon Full star icon 5
Engineering Data Mesh in Azure Cloud helps you to implement Data Mesh approach using different Azure Services. This book provides all information on all challenges along with the solutions involved in implementing data mesh in Microsoft Azure. This book can be helpful to those who want to explore the idea of building a distributed analytical system to handle future growth and agility requirements.A few important topics of the book that I want to highlight are as below:The First part of the book starts with an introduction to Data Mesh architecture and the reason behind considering Data Mesh. It covers the evolution of Data Architecture over time and different challenges of modern day enterprises and how Data Mesh can solve these problems. It also covers different approaches to build your Data Mesh architecture.The book will explain how to deploy a Data Mesh using Azure Cloud Scale Analytics Framework along with building a Data Governance Framework using Microsoft Azure Services.Then it will focus on Security Architecture for Data Mesh and automating the deployment through ARM and Azure DevOps. It will also help you to build a Self-Service Portal for Common Data Mesh Operations.In the next part it will cover the topics of how you can design, build and manage Data Contracts and how to link data contracts to data consumption or pipelines. It will explain you the Data Quality Management related topics along with popular data quality frameworks and tools and go through MDM design patterns and the MDM architecture for a data mesh.It will then cover the Data Mesh and data observability part along with monitoring Data Mesh costs. It will go through different data sharing challenges in Data Mesh along with exploring different methods available for sharing data.The final part covers different popular Data Product Architectures where it will teach you Advanced Analytics using Azure Machine Learning, Databricks and Lakehouse Architecture. It will also teach you how to work on Big Data Analytics using Azure Synapse Analytics along with Event Driven Analytics using Azure Event Hubs, Azure Stream Analytics. It will cover the topics on AI using Azure Cognitive Services and Azure OpenAI.
Amazon Verified review Amazon
J. May 13, 2024
Full star icon Full star icon Full star icon Full star icon Full star icon 5
This book is compact and full of both conceptual and practical information. The author starts from zero, explaining data mesh, both the what and the why, and then proceeds to show how to implement one using Azure. Very well written. To the point.
Amazon Verified review Amazon
Steven Fernandes May 05, 2024
Full star icon Full star icon Full star icon Full star icon Full star icon 5
This book serves as a comprehensive guide for professionals looking to implement a data mesh on the Azure Cloud. It thoroughly covers the strategic planning required to build a collaborative analytics platform, guiding readers through the essential steps from inception to execution. Key topics include the design and management of data contracts, overcoming common challenges, and the intricacies of monitoring and governance. The book also delves into the technical aspects of creating a self-service analytics portal and designing a secure data mesh architecture. Moreover, it offers practical solutions for the challenges faced during data mesh adoption, making it an invaluable resource for anyone embarking on or managing such a project. This guide is well-structured, providing clear and actionable insights that are crucial for successfully implementing a data mesh in a cloud environment.
Amazon Verified review Amazon
Shanthababu Pandian Jun 24, 2024
Full star icon Full star icon Full star icon Full star icon Empty star icon 4
Data engineering is a pivotal function in the industry, and the emergence of data mesh has significantly transformed this practice. This book is a perfect resource, offering a comprehensive understanding of Data Mesh concepts with practical insights, empowering you to implement these concepts effectively in Azure Cloud.The book provides a comprehensive introduction to data meshes, guiding us through the process of building a data mesh strategy. It helps us evaluate our organization’s analytics maturity level, align our strategy with the business, and explore the role of data mesh architecture in this context.The author delves into the practical aspects of deploying data mesh using Azure Cloud-scale Analytics and building a data mesh governance framework using Microsoft services. This focus on Azure Cloud makes the book highly relevant and engaging for Azure Cloud professionals.Following this, the security architecture for data mesh is incredible. It covers various aspects of data security, such as access control and retention, and the security challenges associated with distributed data.Chapters on automating Deployment through Azure Resource Manager and Azure DevOps and Building a Self-Service Portal for Common Data Mesh Operations offer more than just theoretical knowledge. They provide practical insights and solutions that can be immediately implemented in real-world scenarios.These topics are highly recommended for professionals seeking to enhance their data engineering skills.The author, a seasoned expert in designing, building, and managing data contracts, brings a wealth of experience to the table. His deep understanding of data engineering and Azure Cloud services is evident in the formal methods and processes he discusses for maintaining data contracts and SLAs.This expertise builds trust and increases the reliability of the data mesh concepts presented in the book.Data Quality and Master Data Management concepts are key parameters in data.The author explores how, as data mesh grows, data products depend on each other for their outcomes. The explanation of how the bad data quality of one data product could impact multiple products is impressive.Monitoring and Data Observability are other vital concepts we must understand in the data world. He covers monitoring and data observability, which are crucial for data mesh as they enable real-time insights into health and performance.The author explored how to monitor data mesh costs and build a cross-charging model.Specific chapters to understanding data-sharing topologies in a Data Mesh are powerful content in this book, along with various data-sharing topologies, and the different scenarios for using each topology are described.He covered famous data product architecture using advanced analytics aspects by bringing Azure Machine Learning, Databricks, Big Data Analytics Using Azure Synapse Analytics, Event-Driven Analytics Using Azure Event Hubs, Azure Stream Analytics, Azure Machine Learning, AI Using Azure Cognitive Services, and Azure OpenAI.The author gives detailed instructions about data mesh in Azure cloud infrastructure, which is an amazing way to help Data Engineers understand and implement it precisely.Overall … I can give 4.5/5.0 for this. Indeed, an extraordinary effort from the author is much appreciated.-Shanthababu PandianAI and Data Architect | Scrum Master | National and International Speaker | Blogger | Author
Amazon Verified review Amazon
Advitya Gemawat May 14, 2024
Full star icon Full star icon Full star icon Full star icon Empty star icon 4
Some areas covered by the book that stood out to me:🔧 Strong intro on evolution of Data architectures and identifying a strategy to architect your Data Mesh🍱 Utilizing Azure for out-of-the-box Deployment, Governance, and Security capabilities🗃 Dense theory on managing intermediate Data contracts, data quality, & observability📈 My favorite: Practical examples to use AML, Azure Databricks, Azure Synapse, along with a dedicated chapter to connect documents & vector databases using Cognitive Services & Azure OpenAI for custom LLM applicationsThe one limitation I found with this book was the lack of mention about Microsoft Fabric, which is supposed to tie all these data products together in a unified experience. Chapters 14-17 on practical applications are quite useful to learn individual services, but the most relevant learning (and quickest onboarding) may come out of services like Fabric to manage the E2E lifecycle of creating, connection, and testing different services. I'm hoping a future version of this book may adapt Part 4 to center around the overarching experiences Fabric would unlock.I can see Data Architects / technical leads needing to create/maintain large silos of analytics projects directly benefit with Part 1 of the book to holistically analyze their data engineering requirements.Data Engineers will get the quickest benefit from getting hands-on with Azure, but the theory highlighted in Part 2 will perhaps be the most helpful for longer-term as they tackle more technically complex projects.
Amazon Verified review Amazon
Get free access to Packt library with over 7500+ books and video courses for 7 days!
Start Free Trial

FAQs

What is included in a Packt subscription? Chevron down icon Chevron up icon

A subscription provides you with full access to view all Packt and licnesed content online, this includes exclusive access to Early Access titles. Depending on the tier chosen you can also earn credits and discounts to use for owning content

How can I cancel my subscription? Chevron down icon Chevron up icon

To cancel your subscription with us simply go to the account page - found in the top right of the page or at https://subscription.packtpub.com/my-account/subscription - From here you will see the ‘cancel subscription’ button in the grey box with your subscription information in.

What are credits? Chevron down icon Chevron up icon

Credits can be earned from reading 40 section of any title within the payment cycle - a month starting from the day of subscription payment. You also earn a Credit every month if you subscribe to our annual or 18 month plans. Credits can be used to buy books DRM free, the same way that you would pay for a book. Your credits can be found in the subscription homepage - subscription.packtpub.com - clicking on ‘the my’ library dropdown and selecting ‘credits’.

What happens if an Early Access Course is cancelled? Chevron down icon Chevron up icon

Projects are rarely cancelled, but sometimes it's unavoidable. If an Early Access course is cancelled or excessively delayed, you can exchange your purchase for another course. For further details, please contact us here.

Where can I send feedback about an Early Access title? Chevron down icon Chevron up icon

If you have any feedback about the product you're reading, or Early Access in general, then please fill out a contact form here and we'll make sure the feedback gets to the right team. 

Can I download the code files for Early Access titles? Chevron down icon Chevron up icon

We try to ensure that all books in Early Access have code available to use, download, and fork on GitHub. This helps us be more agile in the development of the book, and helps keep the often changing code base of new versions and new technologies as up to date as possible. Unfortunately, however, there will be rare cases when it is not possible for us to have downloadable code samples available until publication.

When we publish the book, the code files will also be available to download from the Packt website.

How accurate is the publication date? Chevron down icon Chevron up icon

The publication date is as accurate as we can be at any point in the project. Unfortunately, delays can happen. Often those delays are out of our control, such as changes to the technology code base or delays in the tech release. We do our best to give you an accurate estimate of the publication date at any given time, and as more chapters are delivered, the more accurate the delivery date will become.

How will I know when new chapters are ready? Chevron down icon Chevron up icon

We'll let you know every time there has been an update to a course that you've bought in Early Access. You'll get an email to let you know there has been a new chapter, or a change to a previous chapter. The new chapters are automatically added to your account, so you can also check back there any time you're ready and download or read them online.

I am a Packt subscriber, do I get Early Access? Chevron down icon Chevron up icon

Yes, all Early Access content is fully available through your subscription. You will need to have a paid for or active trial subscription in order to access all titles.

How is Early Access delivered? Chevron down icon Chevron up icon

Early Access is currently only available as a PDF or through our online reader. As we make changes or add new chapters, the files in your Packt account will be updated so you can download them again or view them online immediately.

How do I buy Early Access content? Chevron down icon Chevron up icon

Early Access is a way of us getting our content to you quicker, but the method of buying the Early Access course is still the same. Just find the course you want to buy, go through the check-out steps, and you’ll get a confirmation email from us with information and a link to the relevant Early Access courses.

What is Early Access? Chevron down icon Chevron up icon

Keeping up to date with the latest technology is difficult; new versions, new frameworks, new techniques. This feature gives you a head-start to our content, as it's being created. With Early Access you'll receive each chapter as it's written, and get regular updates throughout the product's development, as well as the final course as soon as it's ready.We created Early Access as a means of giving you the information you need, as soon as it's available. As we go through the process of developing a course, 99% of it can be ready but we can't publish until that last 1% falls in to place. Early Access helps to unlock the potential of our content early, to help you start your learning when you need it most. You not only get access to every chapter as it's delivered, edited, and updated, but you'll also get the finalized, DRM-free product to download in any format you want when it's published. As a member of Packt, you'll also be eligible for our exclusive offers, including a free course every day, and discounts on new and popular titles.