Getting started with pandas
The pandas
library is a great library for data manipulation and analysis, written by Wes McKinney. The pandas
library provides us with the optimized data structures, series and DataFrame, which are well-suited for descriptive statistics, indexing, and aggregation. Pandas is already installed in the Anaconda distribution used in Wakari. In this section, we will present the basic operations with pandas for Time Series and Multivariate data. We can find more information about pandas from its website:
Working with Time Series
Time Series help us to understand the changes in a variable over time. Pandas include specific functionality in order to work with Time Series transparently. For this section, we need to upload the Gold.csv
file used in Chapter 7, Predicting Gold Prices. The first five rows in the file will look like this:
date,price
1/31/2003,367.5
2/28/2003,347.5
3/31/2003,334.9
4/30/2003,336.8
5/30/2003,361.4
. . .
We will load the...