Further reading
For more information about the policy gradient, we can refer to the following paper:
- Policy Gradient Methods for Reinforcement Learning with Function Approximation by Richard S. Sutton et al., https://papers.nips.cc/paper/1713-policy-gradient-methods-for-reinforcement-learning-with-function-approximation.pdf