Questions
Let's evaluate our newly acquired knowledge by answering the following questions:
- How does TD learning differ from the MC method?
- What is the advantage of using the TD learning method?
- What is TD error?
- What is the update rule of TD learning?
- How does the TD prediction method work?
- What is SARSA?
- How does Q learning differ from SARSA?