Chapter 13
The first aspect to consider for a data analytics solution is where data will be imported from. The sales data is already being stored in a database that can be interrogated directly. The marketing data is being stored as CSV files, and therefore ADLS Gen2 would make an excellent choice for holding these files.
Next, Azure Data Factory can ingest and combine the data into a single output, which can then be stored back in the ADLS.
Finally, Azure Databricks could be the best option for modeling and analyzing the data using optimized Spark clusters. Azure Databricks also supports the latest version of Spark.
The following diagram shows what this might look like:
Example data pipeline using multiple technologies