Exploring the data
Let's explore a little and try to get a feel for the data. First, let's try to get some summary statistics for the various datasets. Afterward, we'll generate some graphs to get a more intuitive sense for what's in the data and how they're related.
Generating summary statistics
Incanter makes generating summary statistics easy. You can pass a dataset to the incanter.stats/summary
function. It returns a sequence of maps. Each map represents the summary data for each column in the original dataset. This includes whether the data is numeric or not. For nominal data, it returns some sample items and their counts. For numeric data, it returns the mean, median, minimum, and maximum.
Summarizing UNODC crime data
If we load the data and filter it for the crime of "burglary", we can get the summary statistics for those fields as follows:
(s/summary (i/$where {:crime {:$eq "CTS 2012 Burglary"}} by-ag-lnd))
And if we pick apart the data structures that it outputs, the following are the...