Balancing the sampling procedure
We saw a few strategies for improving a global deep learning model by adding new types of features. Now, let’s look at a different aspect that is relevant in a global modeling context. In an earlier section, when we were talking about global deep learning models, we talked about how the process by which we sample a window of sequence to feed to our model can be thought of as a two-step process:
- Sampling a time series out of a set of time series
- Sampling a window out of that time series
Let’s use an analogy to make the concept clearer. Imagine we have a large bowl that we have filled with balls. Each ball in the bowl represents a time series in the dataset (a household in our dataset). Now, each ball, , has chits of paper representing all the different windows of samples we can draw from it.
In the batch sampling we use by default, we open all the balls and dump all the chits into the bowl and discard the balls....