In our previous chapter, we learned about DataFrames and datasets and how we can use or write custom encoders to have type-safe operations on datasets. This chapter explains the SQL component of Spark, which helps developers working on Hive or familiar with RDBMS SQL to use a similar style in Spark.
We will be covering the following topics in this chapter:
- Spark metastore
- SQL language manual
- SQL database using Java Database Connectivity (JDBC)