Loading data into a Big Data Cluster
In this recipe, you are going to design an SSIS package used to load data into a database hosted on the remote SQL Server instance.
The SSIS package contains two control flows:
- One extracts key information from the line-of-business database, and loads it into a
Staging
database hosted in Azure - The other simulates a process that delivers data from a different source (for instance, managed by another department in your organization, or another organization altogether) and loads it into your Big Data Cluster that is also hosted in Azure
The objective of this SSIS package is to make the necessary data available at the remote location for further processing. The SSIS package that you are going to design in the last recipe of this chapter, entitled Extracting data from a Big Data Cluster, depends on the data made available by the package designed in this recipe.
Getting ready
We assume that you have already cloned the cookbook...