In this chapter, instead of using the Kafka Java API for producers and consumers, Kafka Streams, or KSQL as in previous chapters, we are going to connect Kafka with Spark Structured Streaming, the Apache Spark solution to process streams with its Datasets API.
This chapter covers the following topics:
- Spark Streaming processor
- Reading Kafka from Spark
- Data conversion
- Data processing
- Writing to Kafka from Spark
- Running the SparkProcessor