Big data is the topic on everyone's mind. I thought it would be good to see what can be done with big data in Jupyter. One up-and-coming language for dealing with large datasets is Spark. Spark is an open source toolset. We can use Spark coding in Jupyter much like the other languages we have seen.
In this chapter, we will cover the following topics:
- Installing Spark for use in Jupyter
- Using Spark's features