Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

Limitless Analytics with Azure Synapse

You're reading from Limitless Analytics with Azure Synapse An end-to-end analytics service for data processing, management, and ingestion for BI and ML

Product type Paperback

Published in Jun 2021

Publisher Packt

ISBN-13 9781800205659

Length 392 pages

Edition 1st Edition

Languages

Python

Tools

Azure

Concepts

Data Processing

Authors (2):

Saranya Ravichander

Prashant Kumar Mishra

View More author details

Table of Contents (20) Chapters

Preface

1. Section 1: The Basics and Key Concepts

2. Chapter 1: Introduction to Azure Synapse FREE CHAPTER

3. Chapter 2: Considerations for Your Compute Environment

4. Section 2: Data Ingestion and Orchestration

5. Chapter 3: Bringing Your Data to Azure Synapse

6. Chapter 4: Using Synapse Pipelines to Orchestrate Your Data

7. Chapter 5: Using Synapse Link with Azure Cosmos DB

8. Section 3: Azure Synapse for Data Scientists and Business Analysts

9. Chapter 6: Working with T-SQL in Azure Synapse

10. Chapter 7: Working with R, Python, Scala, .NET, and Spark SQL in Azure Synapse

11. Chapter 8: Integrating a Power BI Workspace with Azure Synapse

12. Chapter 9: Perform Real-Time Analytics on Streaming Data

13. Chapter 10: Generate Powerful Insights on Azure Synapse Using Azure ML

14. Section 4: Best Practices

15. Chapter 11: Performing Backup and Restore in Azure Synapse Analytics

16. Chapter 12: Securing Data on Azure Synapse

17. Chapter 13: Managing and Monitoring Synapse Workloads

18. Chapter 14: Coding Best Practices

19. Other Books You May Enjoy

Defining source and target datasets

Datasets are created in a pipeline in order to identify data stored in various data sources in different formats, such as tables, files, folders, documents, and so on. A dataset can be used by multiple activities or pipelines.

Before we start adding some transformations onto the data, we should have the required datasets in place. So, follow these instructions to create a dataset for the source:

Go to the Data tab in Synapse Studio and click on + on the Data canvas, as highlighted in the following screenshot:
Figure 4.12 – Creating a dataset in Synapse Studio
Select Integration dataset from the dropdown, and select the required data store from the list of all available data stores appearing in the Integration dataset window. In this example, we are going to select Azure Data Lake Storage Gen2 as our data store, and then click on Continue.
Select the DelimitedText format for your data from the list of all available options...

The rest of the chapter is locked

Register for a free Packt account to unlock a world of extra content!

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (2)

Saranya Ravichander

Saranya Ravichander

Saranya Ravichander has more than 10 years of experience in the IT industry with more than 7 years at Microsoft. Currently, she is a senior cloud solution architect at Microsoft. She specializes in enterprise application management with a core competency in Microsoft Azure.

See other products by Saranya Ravichander

Prashant Kumar Mishra

Prashant Kumar Mishra

Prashant Kumar Mishra is an engineering architect at Microsoft. He has more than 10 years of professional expertise in the Microsoft data and AI segment as a developer, consultant, and architect. He has been focused on Microsoft Azure Cloud technologies for several years now and has helped various customers in their data journey. He prefers to share his knowledge with others to make the data community stronger day by day through his blogs and meetup groups.

See other products by Prashant Kumar Mishra

Other recommended products

Related to this chapter

Azure Data Factory Cookbook

Azure Data Factory Cookbook

With the help of well-structured and practical recipes, this book will teach you how to integrate data from the cloud and on-premise. You'll learn how to transform, clean, and consolidate data into a single data platform and get to grips with using ADF as the main ETL and orchestration tool for your data warehouse or data platform project.

Dec 2020 12h 44m

Azure Data Engineering Cookbook

Azure Data Engineering Cookbook

This book will help you design and implement modern ETL workflows along with data management, monitoring, and security aspects to meet the current organization's needs. You will use various services such as Azure Data Factory, Azure Databricks, Azure Stream Analytics, and Azure Data Explorer to design efficient data processing solutions.

Apr 2021 15h 8m

Cloud Analytics with Microsoft Azure

Cloud Analytics with Microsoft Azure

Cloud Analytics with Microsoft Azure enables you to understand the design and business considerations that you must keep in mind while planning to adopt the cloud analytics model for your business.

Cloud Scale Analytics with Azure Data Services

Cloud Scale Analytics with Azure Data Services

This book will help you to understand the architectural components of a modern data warehouse and select those suitable for your requirements. You'll learn everything from how to integrate your source data into Azure Data Lake at scale to how to structure your analytical data estate and more.

Jul 2021 17h 20m

Data Modeling for Azure Data Services

Data Modeling for Azure Data Services

Data modeling for Azure Data Services teaches you the core concepts of setting up different types of databases for different use cases. With this hands-on guide, you'll learn how to implement the resulting data model in Azure efficiently.

Jul 2021 14h 16m

Cloud Analytics with Microsoft Azure

Cloud Analytics with Microsoft Azure

Cloud Analytics with Microsoft Azure is an end-to-end guide to processing and analyzing big data using a range of Microsoft Azure features. This book covers everything you need to build your own data warehouse and learn numerous techniques to gain useful insights by analyzing big data.

Azure Databricks Cookbook

Azure Databricks Cookbook

The Azure Databricks Cookbook shows you how to work with the latest as well as older versions of Apache Spark and integrate with various Azure resources for orchestrating, deploying, and monitoring big data solutions. You'll use Azure Databricks to build end-to-end solutions and address challenges in securing, productionizing, and monitoring them.

Sep 2021 15h 4m

ETL with Azure Cookbook

ETL with Azure Cookbook

This book will take you through hand-on recipes for extracting, transforming, and loading data using big data tools and Azure services such as Data Factory and Azure Databricks. You will learn how to interact effectively with Azure services, along with covering automation with BIML and data profiling in Azure.

Sep 2020 14h 52m

Stream Analytics with Microsoft Azure

Stream Analytics with Microsoft Azure

This book is your guide to understanding the basics of how Azure Stream Analytics works, and build your own analytics solution using its capabilities. By the end of this book, you will be well-versed in using Azure Stream Analytics to develop an efficient analytics solution which can work with any type of data.

Dec 2017 10h 44m

SQL Server 2017 Integration Services Cookbook

SQL Server 2017 Integration Services Cookbook

SQL Server Integration Services is a tool that facilitates data extraction, consolidation, and loading options (ETL), SQL Server coding enhancements, data warehousing, and customizations. With the help of this book, you'll gain complete hands-on experience of SSIS 2017's new features, and design and development improvements including SCD, Profiling, Tuning, and Customizations.

Jun 2017 18h 36m

Professional Azure SQL Managed Database Administration

Professional Azure SQL Managed Database Administration

Whether it is learning different techniques to monitor and tune an Azure SQL database or improving performance using in-memory technology, this book will enable you to make the most out of Azure SQL database features and functionality for data management solutions.

Mar 2021 24h 8m

Hands-On Data Warehousing with Azure Data Factory

Hands-On Data Warehousing with Azure Data Factory

Azure Data Factory (ADF) is a Microsoft Azure PaaS solution which supports data movement between many on premises and cloud data sources. This book covers custom tailored tutorials to help you develop , maintain and troubleshoot data movement processes and environments using Azure Data Factory V2 and SQL Server Integration Services 2017

May 2018 9h 28m

Personalised recommendations for you

Based on your interests and search pattern

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Data Governance Handbook

Data Governance Handbook

This book provides a highly focused view of real business outcomes powered by data governance, that resonate with non-data executives such as CFOs and CEOs. You'll also find useful insights into how to implement data governance initiatives.

May 2024 13h 8m

Data Engineering with Databricks Cookbook

Data Engineering with Databricks Cookbook

This book shows you how to use Apache Spark, Delta Lake, and Databricks to build data pipelines, manage and transform data, optimize performance, and more. Additionally, you'll implement DataOps and DevOps practices, and orchestrate data workflows.

May 2024 14h 36m

Azure Data Engineer Associate Certification Guide

Azure Data Engineer Associate Certification Guide

Unlock the power of Azure data engineering with this certification guide, elevating your skills in data processing, storage, and security with the help of practical insights, hands-on exercises, and the latest advancements.

May 2024 18h 16m

Microsoft Power BI Cookbook

Microsoft Power BI Cookbook

Microsoft Power BI is the most sought-after platform for BI professionals' visualization needs. Explore the latest Power BI features, future AI enhancements, and integration with other Power Platform tools via new recipes in this updated edition.

Jul 2024 19h 56m

Python Data Cleaning Cookbook

Python Data Cleaning Cookbook

The book shows you how to clean, wrangle, and view data from multiple perspectives, including dataset and column attributes. You will cover common and not-so-common challenges that are faced while cleaning messy data for complex situations and learn to manipulate data to get it down to a form that can be useful for making the right decisions.

May 2024 16h 12m

Microsoft Azure AI Fundamentals AI-900 Exam Guide

Microsoft Azure AI Fundamentals AI-900 Exam Guide

This AI-900 study guide will help you prepare and practice for the certification exam. You'll delve into AI workloads, ML principles, computer vision, NLP, knowledge mining, and generative AI using Azure cloud services.

May 2024 9h 36m

Using Stable Diffusion with Python

Using Stable Diffusion with Python

This book shows you how to use Python to control Stable Diffusion and generate high-quality images. In addition to covering the basic usage of the diffusers package, the book provides solutions for extending the package for more advanced purposes.

Jun 2024 11h 44m

Getting Started with DuckDB

Getting Started with DuckDB

This hands-on book teaches you to analyze large datasets with blazing speed and ease. You will learn how to use DuckDB to quickly load, query, transform, analyze, and visualize data effectively through a series of practical examples.

Jun 2024 12h 44m

Databricks Certified Associate Developer for Apache Spark Using Python

Databricks Certified Associate Developer for Apache Spark Using Python

This guide gets you ready for certification with expert-backed content, key exam concepts, and topic reviews. Additionally, you'll be able to make the most of Apache Spark 3.0 to modernize workloads and more using specific tools and techniques.