Here is what a simple neural network with loops looks like:
data:image/s3,"s3://crabby-images/1af6c/1af6c2905d771573dc19d2ff2e00aa3ad427d15c" alt=""
In this diagram, a Neural Network N takes input to produce output
. Due to the loop, at the next time step
, it takes the input
along with input
to produce output
. Mathematically, we represent this as the following equation:
data:image/s3,"s3://crabby-images/9981e/9981e2ccb570ac7f629f74e93368b10c41de7efd" alt=""
When we unroll the loop, the RNN architecture looks as follows at time step :
data:image/s3,"s3://crabby-images/cdf79/cdf79a98472b29769e02b3595b3cbc5486efcdf6" alt=""
As the time steps evolve, this loop unrolls as follows at time step 5:
data:image/s3,"s3://crabby-images/ca719/ca719df778cca30079f12ec404e3096bbc76112e" alt=""
At every time step, the same learning function, , and the same parameters, w and b, are used.
The output y is not always produced at every time step. Instead, an output h is produced at every time step, and another activation function is applied to this output h to produce the output y. The equations for the RNN look like this now:
data:image/s3,"s3://crabby-images/8c6bd/8c6bd927a804a6ea6437ed53582516a092d875af" alt=""
data:image/s3,"s3://crabby-images/7d66a/7d66a1792ee07ca06bef27fbcba79c52ad616c9f" alt=""
where,
- ...