Technical requirements
This chapter will return to the Java Software Development Kit (SDK), so the tools will be the same as those in Chapter 4, Structuring Code for Reusability. As always, we will need a cloned version of the GitHub repository for this book located at https://github.com/PacktPublishing/Building-Big-Data-Pipelines-with-Apache-Beam.
So, let's dive directly into the (admittedly non-trivial) concept of splittable DoFn
!