Chapter 1. Big Data Analytics with Spark
In this chapter, we will cover the components of Spark. You will learn them through the following recipes:
- Initializing SparkContext
- Working with Spark's Python and Scala shells
- Building standalone applications
- Working with the Spark programming model
- Working with pair RDDs
- Persisting RDDs
- Loading and saving data
- Creating broadcast variables and accumulators
- Submitting applications to a cluster
- Working with DataFrames
- Working with Spark Streaming