Chapter 9: Playing Doom with a Deep Recurrent Q Network
- DRQN makes use of recurrent neural network (RNN) where DQN makes use of vanilla neural network.
- DQN is not used applied when the MDP is partially observable.
- Refer section Doom with DRQN.
- DARQN makes use of attention mechanism unlike DRQN.
- DARQN is used to understand and focus on particular area of game screen which is more important.
- Soft and hard attention.
- We set living reward to 0 which the agent does for each move, even though the move is not useful.