Standard Julia capabilities
Similar to functions in other languages, Julia can perform most of the rudimentary statistics on your data using the describe
function, as in the example script that follows:
using RDatasets describe(dataset("datasets", "iris"))
This script accesses the Iris dataset and displays summary statistics on the dataset.
If we were to build a notebook to show describe
in use against the iris
dataset (loaded in the previous example), we would end up with a display like this:
You can see the standard statistics generated for each of the variables in the dataset. I thought it was interesting that the count and percentage of NA values in the dataset are provided. I have found that I usually have to double-check to exclude this data using other languages. Here, it is a quick, built-in reminder.