What this book covers
Chapter 1, Getting Started with Breeze, serves as an introduction to the Breeze linear algebra library's API.
Chapter 2, Getting Started with Apache Spark DataFrames, introduces powerful, yet intuitive and relational-table-like, data abstraction.
Chapter 3, Loading and Preparing Data – DataFrame, showcases the loading of datasets into Spark DataFrames from a variety of sources, while also introducing the Parquet serialization format.
Chapter 4, Data Visualization, introduces Apache Zeppelin for interactive data visualization using Spark SQL and Spark UDF functions. We also briefly discuss Bokeh-Scala, which is a Scala port of Bokeh (a highly customizable visualization library).
Chapter 5, Learning from Data, focuses on machine learning using Spark MLlib.
Chapter 6, Scaling Up, walks through various deployment alternatives for Spark applications: standalone, YARN, and Mesos.
Chapter 7, Going Further, briefly introduces Spark Streaming and GraphX.