Open source implementations of policy-gradient methods
In this chapter we have covered many algorithms. It is not quite possible to explicitly implement all these algorithms given the space limitations here. We instead relied on RLlib implementations to train agents for our use case. RLlib is open source, so you can go to https://github.com/ray-project/ray/tree/releases/1.0.1/rllib and dive into implementations of these algorithms.
Having said that, RLlib implementations is built for production systems and therefore involve many other implementations regarding error-handling, and preprocessing. In addition, there is a lot of code reuse, resulting in implementations of with multiple class inheritances. A much easier set of implementations is provided by OpenAI's Spinning Up repo at https://github.com/openai/spinningup. I highly recommend you go into that repo and dive into the implementation details of these algorithms we discussed in this chapter.
Info
OpenAI Spinning...