Search icon CANCEL
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Limitless Analytics with Azure Synapse

You're reading from   Limitless Analytics with Azure Synapse An end-to-end analytics service for data processing, management, and ingestion for BI and ML

Arrow left icon
Product type Paperback
Published in Jun 2021
Publisher Packt
ISBN-13 9781800205659
Length 392 pages
Edition 1st Edition
Languages
Tools
Arrow right icon
Authors (2):
Arrow left icon
Saranya Ravichander Saranya Ravichander
Author Profile Icon Saranya Ravichander
Saranya Ravichander
Prashant Kumar Mishra Prashant Kumar Mishra
Author Profile Icon Prashant Kumar Mishra
Prashant Kumar Mishra
Arrow right icon
View More author details
Toc

Table of Contents (20) Chapters Close

Preface 1. Section 1: The Basics and Key Concepts
2. Chapter 1: Introduction to Azure Synapse FREE CHAPTER 3. Chapter 2: Considerations for Your Compute Environment 4. Section 2: Data Ingestion and Orchestration
5. Chapter 3: Bringing Your Data to Azure Synapse 6. Chapter 4: Using Synapse Pipelines to Orchestrate Your Data 7. Chapter 5: Using Synapse Link with Azure Cosmos DB 8. Section 3: Azure Synapse for Data Scientists and Business Analysts
9. Chapter 6: Working with T-SQL in Azure Synapse 10. Chapter 7: Working with R, Python, Scala, .NET, and Spark SQL in Azure Synapse 11. Chapter 8: Integrating a Power BI Workspace with Azure Synapse 12. Chapter 9: Perform Real-Time Analytics on Streaming Data 13. Chapter 10: Generate Powerful Insights on Azure Synapse Using Azure ML 14. Section 4: Best Practices
15. Chapter 11: Performing Backup and Restore in Azure Synapse Analytics 16. Chapter 12: Securing Data on Azure Synapse 17. Chapter 13: Managing and Monitoring Synapse Workloads 18. Chapter 14: Coding Best Practices 19. Other Books You May Enjoy

Implementing best practices for a Synapse Spark pool

As with Synapse SQL pools, it is also important to keep our Spark pool healthy. In this section, we are going to learn how to optimize cluster configuration for any particular workload. We will also learn how to use various techniques for enhancing Apache Spark performance.

Configuring the Auto-pause setting

There are some major advantages of using Platform-as-a-Service (PaaS) instead of an on-premises environment, and the Auto-pause setting is one of the best features that PaaS has to offer. If you are running a Spark cluster on your on-premises environment, you need to pay for provisioning it even though you may only need to use this cluster for a couple of hours a day. However, Synapse gives you the option to configure the Auto-pause setting to pause a cluster automatically if not in use. Upon entering a value for the Number of minutes idle field within the Auto-pause setting, the Spark pool will go to a Pause state automatically...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at €18.99/month. Cancel anytime