Creating Data Pipelines
Data pipelines are a collection of various data-processing activities arranged in a particular sequence to produce the desired insights from raw data. You have already seen many examples in Azure Data Factory in various sections, such as Designing and Implementing Incremental Loads, Transform Options Available in ADF, and ADF Templates (Chapter 4, Ingesting and Transforming Data), where you chain the activities together to produce a final desirable outcome. ADF is not the only technology available in Azure; Azure also supports Synapse pipelines—an implementation of ADF within Synapse—and open source technologies such as Oozie (available via Azure HDInsight), which can help orchestrate pipelines.
Note
This section primarily focuses on the Create data pipelines concept of the DP-203: Data Engineering on Microsoft Azure exam.
If your workload only uses open-source software, then Oozie will be an alternative solution. However, if the pipeline...