Components
In Figure 14.1, starting from left to right, let’s look at each component and understand their functionality/attributes.
Source data
Source data can be document repositories, social media feeds, images, or voice recordings. Semi-structured data is typically stored in comma-separated files (CSV) or the JSON format. These could be customer profiles or product details.
Azure Data Factory
Azure Data Factory is a cloud-scale extract, transform, and load (ETL) framework. It has ready-made connectors to over a hundred different sources. It can connect to SAP, Oracle, SQL Server, and many more enterprise systems and data stores. It can have connectors for various cloud services that extend beyond Azure to Google Cloud Platform (GCP) and Amazon Web Services (AWS).
The following are some resources that can help you learn more about Azure Data Factory and its connectors:
- Azure Data Factory documentation: https://learn.microsoft.com/en-us/azure/data-factory...