Spark connectivity to BI tools
In the era of big data and artificial intelligence (AI), Hadoop and Spark have modernized data warehouses into distributed warehouses that can process up to petabytes (PB) of data. Thus, BI tools have also evolved to utilize Hadoop- and Spark-based analytical stores as their data sources, connecting to them using JDBC/ODBC. BI tools ranging from Tableau, Looker, Sisense, MicroStrategy, Domo, and so on all feature connectivity support and built-in drivers to Apache Hive and Spark SQL. In this section, we will explore how you can connect a BI tool such as Tableau Online with Databricks Community Edition, via a JDBC connection.
Tableau Online is a BI platform fully hosted in the cloud that lets you perform data analytics, publish reports and dashboards, and create interactive visualizations, all from a web browser. The following steps describe the process of connecting Tableau Online with Databricks Community Edition:
- If you already have an existing...