The pandas package was created by Wes McKinney in 2008 as a result of frustrations he encountered while working on time series data in R. It is built on top of NumPy and provides features not available in it. It provides fast, easy-to-understand data structures and helps fill the gap between Python and a language like R. NumPy deals with homogeneous blocks of data. Using pandas helps to deal with data in a tabular structure composed of different data types.
The official documentation for pandas can be found at http://pandas.pydata.org/pandas-docs/stable/dsintro.html.
There are three main data structures in pandas:
- Series—1D
- DataFrame—2D
- Panel—3D