Comparison between GraphFrames and GraphX
It is important to look at a quick comparison between GraphX and GraphFrames as it gives you an idea as to where GraphFrames are going. Joseph Bradley, who is a software Engineer at Databricks, gave a brilliant talk on GraphFrames and the difference between the two APIs. The talk is available at http://bit.ly/2hBrDwH. Here is a summary of the comparison:
GraphFrames |
GraphX | |
Core APIs |
Scala, Java, Python |
Scala only |
Programming Abstraction |
DataFrames |
RDDs |
Use Cases |
Algorithms, Queries, Motif Finding |
Algorithms |
VertexIds |
Any type (in Catalyst) |
Long |
Vertex/edge attributes |
Any number of DataFrame columns |
Any type (VD,ED) |
Return Types |
GraphFrames/DataFrames |
Graph [VD,ED] or RDD [Long,VD] |
GraphX <=> GraphFrames
If you have invested heavily into GraphX already and are wondering how you will migrate your existing code to GraphFrames, you are about to receive some good news. Apache Spark provides seamless conversions...