Search icon CANCEL
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Hamburger list icon
Apache Spark This page features the latest and most popular technology books that focus on Apache Spark. Apache Spark is an open source distributed processing framework used for big data analytics and machine learning. It offers performance up to 100x faster than Hadoop MapReduce for certain applications. This page includes descriptions of bestselling books that teach critical skills for data engineers, data scientists, developers and more to leverage Spark for building data pipelines, performing analytics at scale, applying machine learning algorithms and more. The books cover beginner to advanced topics across the Spark ecosystem including Spark SQL, Spark Streaming, GraphX and MLlib.
The Most Popular in Apache Spark
Essential PySpark for Scalable Data Analytics