Chapter 4: Azure Data Integration
This chapter will teach you recipes for getting SSIS to interact with Azure services. Microsoft has created tools to enable SSIS to read, write, and call Apache Hive and Pig scripts in Azure. Performing all these tasks from an SSIS on-premises ETL server is not ideal in terms of security. You need to open multiple ports on the ETL server to access cloud services. A subsequent chapter will show you how to call those services from a cloud ETL tool known as Azure Data Factory.
This chapter will cover the following recipes:
- Setting up the Azure Feature Pack in SSIS
- Setting up a Java Runtime Environment
- Setting up an Azure storage account
- Creating an on-demand Azure HDInsight cluster
- Transforming data with Hive
- Transforming data with Pig
Let's get started!