Further reading
For more information, we can refer to the following papers:
- Playing Atari with Deep Reinforcement Learning by Volodymyr Mnih, et al., https://arxiv.org/pdf/1312.5602.pdf
- Deep Reinforcement Learning with Double Q-learning by Hado van Hasselt, Arthur Guez, David Silver, https://arxiv.org/pdf/1509.06461.pdf
- Prioritized Experience Replay by Tom Schaul, John Quan, Ioannis Antonoglou and David Silver, https://arxiv.org/pdf/1511.05952.pdf
- Dueling Network Architectures for Deep Reinforcement Learning by Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, Nando de Freitas, https://arxiv.org/pdf/1511.06581.pdf
- Deep Recurrent Q-Learning for Partially Observable MDPs by Matthew Hausknecht and Peter Stone, https://arxiv.org/pdf/1507.06527.pdf