Search icon CANCEL
Subscription
0
Cart icon
Close icon
You have no products in your basket yet
Save more on your purchases!
Savings automatically calculated. No voucher code required
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Apache Spark 2.x for Java Developers

You're reading from  Apache Spark 2.x for Java Developers

Product type Book
Published in Jul 2017
Publisher Packt
ISBN-13 9781787126497
Pages 350 pages
Edition 1st Edition
Languages
Authors (2):
Sourav Gulati Sourav Gulati
Profile icon Sourav Gulati
Sumit Kumar Sumit Kumar
Profile icon Sumit Kumar
View More author details

Table of Contents (19) Chapters

Title Page
Credits
Foreword
About the Authors
About the Reviewer
www.PacktPub.com
Customer Feedback
Preface
1. Introduction to Spark 2. Revisiting Java 3. Let Us Spark 4. Understanding the Spark Programming Model 5. Working with Data and Storage 6. Spark on Cluster 7. Spark Programming Model - Advanced 8. Working with Spark SQL 9. Near Real-Time Processing with Spark Streaming 10. Machine Learning Analytics with Spark MLlib 11. Learning Spark GraphX

Introduction to Property Graph


It is the basic abstraction of the Graphx API. Property is a directed multi-graph where every vertex and edge is associated with a property. Each vertex in the Property Graph is also associated with a unique 64-bit long identifier (VertexId). A directed multi-graph is defined as a directed graph where there can be multiple edges (relationships) between the same vertices, such as A can be a friend and team mate of B.

The following is a logical representation of a Property Graph:

Logical representation of Property Graph

Here, we have a Property Graph consisting of five vertices. Each vertex in the graph consists of a VertexId and a property, which is a string object in this case, and every edge is also associated with a property, which is a string object as well, which describes the relation between the vertices.

Spark stores vertices and edges in different RDDs as follows:

Storage representation of Property Graph

Every element of the RDD of vertices contains a VertexId...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at €14.99/month. Cancel anytime}