Tuning queries by using indexers
Indexing is another common optimization technology used in database systems, data warehouses, and analytical engines such as Spark. Let's look at the indexing options and tuning guidelines for both Synapse SQL and Spark.
Indexing in Synapse SQL
If you remember, we learned about the different types of indexing in Chapter 5, Implementing Physical Data Storage Structures, in the Implementing different table geometries with Azure Synapse Analytics pools section. I'll recap the different types of indexers we have along with tips for tuning Synapse SQL here again.
There are three primary types of indexing available in Synapse SQL:
- Clustered Columnstore Index: This is the default index option for Synapse SQL. If you don't specify any indexing options, the table will automatically get indexed using this method. Use this index for large tables > 100 million rows. It provides very high levels of data compression and good overall...