Smoothing the noise in real-world data
In this recipe, we introduce a few advanced algorithms to help with cleaning the data coming from real-world sources. These algorithms are well known in the signal processing world, and we will not go deep into mathematics but will just exemplify how and why they work and for what purposes they can be used.
Getting ready
Data that comes from different real-life sensors usually is not smooth and clean and contains some noise that we usually don't want to show on diagrams and plots. We want graphs and plots to be clear and to display information and cost viewers minimal efforts to interpret.
We don't need any new software installed because we are going to use some already familiar Python packages: NumPy, SciPy, and matplotlib.
How to do it...
The basic algorithm is based on using the rolling window (for example, convolution). This window rolls over the data and is used to compute the average over that window.
For our discrete data, we use NumPy's
convolve
...