Understanding the encoder-decoder paradigm
In Chapter 5, Time Series Forecasting as Regression, we saw that machine learning is all about learning a function that maps our inputs to the desired output:
y = h(x)
where x is the input and y is our desired output.
Adapting this to time series forecasting (using univariate time series forecasting to keep things simple), we can rewrite it as follows:
yt = h(yt-1, yt-2, …, yt-N)
Here, t is the current timestep and N is the total amount of history available at time t.
Deep learning, like any other machine learning approach, is tasked with learning this function, which maps history to the future. In Chapter 11, Introduction to Deep Learning, we saw how deep learning learns good features using representation learning, and then it uses the learned features to carry out the task at hand. This understanding can be further refined to the time series perspective by using the encoder-decoder paradigm.
Like everything...