An Introduction to Data Analysis in Python Polars
Data analysis is a broad term that encompasses various steps of inspecting, transforming, and understanding data in order to uncover valuable insights. This chapter focuses on teaching you the fundamentals of data analysis in Python Polars while exploring the dataset. You’ll learn how to inspect your data, generate its summary statistics, adjust data types to suit your needs, and clean the data for further analysis.
In this chapter, we’re going to cover the following main topics:
- Inspecting the DataFrame
- Casting data types
- Handling duplicate values
- Masking sensitive data
- Visualizing data using Plotly
- Detecting and handling outliers