7. Introduction to Analytics Engine (Spark) for Big Data
Overview
This chapter will help you learn the fundamentals of Apache Spark. By combining a sequence of transformations and actions, you will be able to create a pipeline in Spark and run it. We will be using Databricks to launch and use a Spark cluster. By the end of this chapter, you should be comfortable with creating and running a Spark pipeline using a Databricks notebook on a Spark cluster.