Connecting Azure Data Lake to Azure Data Factory and loading data
Moving data is one of the typical tasks done by data engineers. In this recipe, we will be connecting Azure Data Factory to external storage (Azure Blob Storage) and moving the Chicago Safety Data
dataset to Azure Data Lake Gen2 that we set up in the previous recipe.
Getting ready
Make sure you have set up Azure Data Lake Gen2 in the Setting up Azure Data Lake Storage Gen 2 recipe.
The dataset that we are going to use in this recipe, Chicago Safety Data
, is stored here: https://azure.microsoft.com/en-us/services/open-datasets/catalog/chicago-safety-data/. This dataset is published as a part of Azure Open Datasets, which is built to distribute data.
How to do it...
To transfer the dataset from Azure Blob storage to Azure Data Lake Gen2 with Data Factory, first, let's go to the Azure Data Factory UI:
- Click + and select Copy Data tool as shown in the following screenshot: