Questions
- In Q-learning, what does the Q stand for?
Hint: You will have to research this yourself.
- What could we do to limit the number of states that the Q-learning algorithm has to search through?
- What effect does changing the learning rate have on the learning process?
- What function or parameter serves to penalize longer paths in the Q-learning equation? What effect does increasing or decreasing this function have?
- In the genetic algorithm, how would you go about penalizing longer paths so that shorter paths (fewer number of steps) would be preferred?
- Look up the SARSA variation of Q-learning. How would you implement the SARSA technique into program 2.
- What effect does changing the learning rate in the genetic algorithm have? What are the upper and lower bounds of the learning rate?
- In a genetic algorithm, what effect does reducing the population have?