Questions
The question list is as follows:
- What is the difference between a DQN and a dueling DQN?
- Write the Python code for a replay buffer.
- What is a target network?
- Write the Python code for a prioritized experience replay buffer.
- Create a Python function to decay an epsilon-greedy policy.
- How does a dueling DQN differ from a double DQN?
- Create a Python function for updating primary network weights to the target network.