Summary
In this chapter, we learned about graph analytics. We saw how graphs can be built even on top of massive big datasets. We learned how Apache Spark can be used to build these massive graphs and in the process we learned about the new library GraphFrames that helps us in building these graphs. We started with the basics of graphs as to how graphs can be built and represented and later we explored the different forms of analytics that can be run on those graphs be it path-based analytics involving algorithms such as breadth first search or connectivity analytics involving the degrees of connection. A flight dataset was used to explore the different forms of graph analytics while using a real-world dataset.
Up until now, we have mostly used the data and the program in a batch mode. In the next chapter, we will see how big data can even be used in our analysis at real time.