Summary
Over the course of this chapter, we have learned how to manipulate data in Spark DataFrames.
We talked about the Spark DataFrame API and what different data types are in Spark. We also learned how to create DataFrames in Spark and how we can view these DataFrames once they’ve been created. Finally, we learned about different data manipulation and data aggregation functions.
In the next chapter, we will cover some advanced operations in Spark with respect to data manipulation.