You're reading from Azure Data Engineering Cookbook Get well versed in various data engineering techniques in Azure using this recipe-based guide

Product type Paperback

Published in Sep 2022

Publisher Packt

ISBN-13 9781803246789

Length 608 pages

Edition 2nd Edition

Languages

SQL

Tools

Azure

Concepts

Data Engineering

Authors (3):

Ahmad Osama

Nagaraj Venkatesan

Luca Zanna

View More author details

Table of Contents (16) Chapters

Preface

1. Chapter 1: Creating and Managing Data in Azure Data Lake

2. Chapter 2: Securing and Monitoring Data in Azure Data Lake FREE CHAPTER

3. Chapter 3: Building Data Ingestion Pipelines Using Azure Data Factory

4. Chapter 4: Azure Data Factory Integration Runtime

5. Chapter 5: Configuring and Securing Azure SQL Database

6. Chapter 6: Implementing High Availability and Monitoring in Azure SQL Database

7. Chapter 7: Processing Data Using Azure Databricks

8. Chapter 8: Processing Data Using Azure Synapse Analytics

9. Chapter 9: Transforming Data Using Azure Synapse Dataflows

10. Chapter 10: Building the Serving Layer in Azure Synapse SQL Pool

11. Chapter 11: Monitoring Synapse SQL and Spark Pools

12. Chapter 12: Optimizing and Maintaining Synapse SQL and Spark Pools

13. Chapter 13: Monitoring and Maintaining Azure Data Engineering Pipelines

14. Index

Why subscribe?

15. Other Books You May Enjoy

Optimizing query performance in Synapse Spark pools

There are several methods you can use to optimize the performance of queries in a lake database, such as caching, indexing, partitioning, Z-ordering, data skipping, and using query hints. This recipe will showcase the following two methods to optimize the performance of a query:

Z-ordering: Z-ordering helps the Spark engine easily locate columns with the same value
Partitioning: Partitioning will partition the Delta lake table into smaller chunks, creating subfolders in the data lake storage account for each distinct value on the partitioned column

Getting ready

To get started, log into https://portal.azure.com using your Azure credentials.

Create a Synapse Analytics workspace, as explained in the Provisioning an Azure Synapse Analytics workspace recipe of Chapter 8, Processing Data Using Azure Synapse Analytics.

Create a Spark pool cluster, as explained in the Provisioning and configuring Spark pools recipe...

The rest of the chapter is locked

You're reading from Azure Data Engineering Cookbook Get well versed in various data engineering techniques in Azure using this recipe-based guide

Table of Contents (16) Chapters

Optimizing query performance in Synapse Spark pools

Getting ready

Authors (2)

Personalised recommendations for you

You're reading from Azure Data Engineering Cookbook Get well versed in various data engineering techniques in Azure using this recipe-based guide

Table of Contents (16) Chapters

Optimizing query performance in Synapse Spark pools

Getting ready

Unlock this book and the full library FREE for 7 days

Authors (2)

Personalised recommendations for you