Try answering the following questions:
- What is the difference between an RNN and a feedforward neural network?
- How is the hidden state computed in an RNN?
- What is the use of a recurrent network?
- How does the vanishing gradient problem occur?
- What is the exploding gradient problem?
- How gradient clipping mitigates the exploding gradient problem?
- What are the different types of RNN architectures?