This section introduces model-free RL algorithms, value-based methods, and policy gradient methods. You will also develop many state-of-the-art algorithms.
This section includes the following chapters:
- Chapter 4, Q-Learning and SARSA Applications
- Chapter 5, Deep Q-Network
- Chapter 6, Learning Stochastic and PG Optimization
- Chapter 7, TRPO and PPO Implementation
- Chapter 8, DDPG and TD3 Applications