This chapter will explain what SQL isand why it is important for data analysis by teaching you how to use and access databases using SQLite for our examples. An overview of relational database technology will be provided along with insightful information on database systems to help to improve your data literacy when communicating with experts. You will also learn how to run SQLSELECTqueries from the Jupyter Notebook and how to load them into DataFrames. Basic statistics, data lineage, and metadata (data about data) will be explained using thepandaslibrary.
We will cover the following topics in this chapter:
- Introduction to SQL and relational databases
- From SQL to pandas DataFrames
- Data about your data explained
- The importance of data lineage