Chapter 7. Apache Spark GraphX
In this chapter, we want to examine the Apache Spark GraphX module and graph processing, in general. So, this chapter will cover the topic of implementing graph analysis workflows on top of GraphX. The GraphX coding section, written in Scala, will provide a series of graph coding examples. Before writing code in Scala to use the Spark GraphX module, we think it will be useful to provide an overview of what a graph actually is in terms of graph processing. The following section provides a brief introduction using a couple of simple graphs as examples.
In this chapter we will cover:
- Creating a graph from raw data
- Counting
- Filtering
- PageRank
- Triangle count
- Connected components