Chapter 7. Statistical Data Analysis
In this chapter, we will cover the following topics:
- Exploring a dataset with pandas and Matplotlib
- Getting started with statistical hypothesis testing – a simple z-test
- Getting started with Bayesian methods
- Estimating the correlation between two variables with a contingency table and a chi-squared test
- Fitting a probability distribution to data with the maximum likelihood method
- Estimating a probability distribution nonparametrically with a kernel density estimation
- Fitting a Bayesian model by sampling from a posterior distribution with a Markov chain Monte Carlo method
- Analyzing data with the R programming language in the Jupyter Notebook