Understanding DataFrames
A DataFrame is a data structure that has labeled columns, which may individually have different data types. Like a SQL table or a spreadsheet, it has two dimensions. It can also be thought of as a list of dictionaries, but fundamentally, it is different.
DataFrames are the recommended data structure for statistical analysis. Julia provides a package called DataFrames.Jl
, which has all the necessary functions to work with DataFrames.
Julia's package, DataFrames, provides three data types:
NA
: A missing value in Julia is represented by a specific data type,NA
.DataArray
: The array type defined in the standard Julia library, though it has many features, doesn't provide any specific functionalities for data analysis. TheDataArray
type provided inDataFrames.jl
provides such features (for example if we needed to store some missing values in the array).DataFrame
: This is a two-dimensional data structure, such as spreadsheets. It is much like R or Pandas DataFrames and provides...