In this chapter, we explored the basic operations and functions of MongoDB. We also performed a data preparation of a CSV dataset with OpenRefine and turned it into a well-formatted JSON dataset. Finally, we present a data processing introduction with the aggregation framework, which is a faster alternative to MapReduce for common aggregations. We introduced the basic operators used in the pipelines and the expressions supported by the aggregation framework.
In the next chapter, we will explore the MapReduce functionality of MongoDB and we will create a word-cloud in D3 with the most frequent words in positive tweets.