Aggregate versus connected data models
Data can be modeled in Neo4j using both connected and aggregated forms. The connected form has the advantage that new dimensions can be easily extracted from the same model. In comparison, the aggregate form (also can be called as MapReduce) can give us quick access to the aggregated result, which is very useful in particular. In this recipe, you will learn both data models and a third one also, which has been derived by combining both of these data models.
Getting ready
Install the Neo4j graph database on the host machine using the recipe described in Chapter 1, Getting Started with Neo4j. This installation process will depend on your preference for your machine OS type.
How to do it...
Perform the following steps to get started with this recipe:
- In this recipe, we will use Apache log files to describe both connected and aggregated data models. A typical log file has the following structure:
LogFormat "%h %l %u %t \"%r\" %>s %b" common...