- Reinforcement learning: https://en.wikipedia.org/wiki/Reinforcement_learning
- The Bellman equation: https://en.wikipedia.org/wiki/Bellman_equation
- tf.buffer: https://js.tensorflow.org/api/0.11.2/#buffer
- Q-learning: https://en.wikipedia.org/wiki/Q-learning
- CartPole game: https://gym.openai.com/envs/CartPole-v0/





















































