Using PolyBase to Load Data to a SQL Pool
PolyBase is a tool that enables services such as SQL Server and Synapse dedicated SQL pools to copy and query data directly from external locations, including sources such as Azure Storage, Oracle, Teradata, Hadoop, and MongoDB. PolyBase is integrated into T-SQL, so every time you use a COPY INTO <table> FROM
command to read data from an external storage location, PolyBase kicks in. It is one of the fastest and most scalable ways to copy data.
Note
This section primarily focuses on the Use PolyBase to load data to a SQL pool concept of the DP-203: Data Engineering on Microsoft Azure exam.
For the data lake scenario, you are going to use PolyBase to copy the transformed data from ADB into a Synapse-dedicated SQL pool using a staging ADLS or Blob Storage. Perform the following steps to do so:
- Prepare the source data in text files in ADLS or the Blob Storage.
- Define an external table with the right schema in the dedicated...