In this section, you will learn how to load/import your data and save it. There are many different ways of loading data, and the right way depends on your file type. You can load/import text files, SAS/Stata files, HDF5 files, and many others. HDF (Hierarchical Data Format) is one of the popular data formats which is used to store and organize large amounts of data and it is very useful while working with a multidimensional homogeneous arrays. For example, Pandas library has a very handy class named as HDFStore where you can easily work with HDF5 files. While working on data science projects, you will most likely see many of these types of files, but in this book, we will cover the most popular ones, such as NumPy binary files, text files (.txt), and comma-separated values (.csv) files.
If you have a large dataset in memory and on disk to manage, you can...