Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Machine Learning with Apache Spark Quick Start Guide Uncover patterns, derive actionable insights, and learn from big data using MLlib

Product type Paperback

Published in Dec 2018

Publisher Packt

ISBN-13 9781789346565

Length 240 pages

Edition 1st Edition

Languages

Java

Tools

Apache Spark

Concepts

Big Data

Author (1):

Jillur Quddus

View More author details

Table of Contents (10) Chapters

Preface

1. The Big Data Ecosystem FREE CHAPTER

2. Setting Up a Local Development Environment

3. Artificial Intelligence and Machine Learning

4. Supervised Learning Using Apache Spark

5. Unsupervised Learning Using Apache Spark

6. Natural Language Processing Using Apache Spark

7. Deep Learning Using Apache Spark

8. Real-Time Machine Learning Using Apache Spark

9. Other Books You May Enjoy

Leave a review - let other readers know what you think

Distributed stream processing engines

Apache Kafka allows us to move real-time data reliably between systems and applications. But we still need some sort of processing engine to process and transform that real-time data in order ultimately to derive value from it based on the use case in question. Fortunately, there are a number of stream processing engines available to allow us to do this, including—but not limited—to the following:

Apache Spark: https://spark.apache.org/
Apache Storm: http://storm.apache.org/
Apache Flink: https://flink.apache.org/
Apache Samza: http://samza.apache.org/
Apache Kafka (via its Streams API): https://kafka.apache.org/documentation/
KSQL: https://www.confluent.io/product/ksql/

Though a detailed comparison of the available stream processing engines is beyond the scope of this book, you are encouraged to explore the preceding links...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at ₹800/month. Cancel anytime

Authors (1)

Quddus

Jillur Quddus is a lead technical architect, polyglot software engineer and data scientist with over 10 years of hands-on experience in architecting and engineering distributed, scalable, high-performance, and secure solutions used to combat serious organized crime, cybercrime, and fraud. Jillur has extensive experience of working within central government, intelligence, law enforcement, and banking, and has worked across the world including in Japan, Singapore, Malaysia, Hong Kong, and New Zealand. Jillur is both the founder of Keisan, a UK-based company specializing in open source distributed technologies and machine learning, and the lead technical architect at Methods, the leading digital transformation partner for the UK public sector.

See other products by Quddus