Quantile Regression DQN
In this section, we will look into another interesting distributional RL algorithm called QR-DQN. It is a distributional DQN algorithm similar to the categorical DQN; however, it has several features that make it more advantageous than a categorical DQN.
Math essentials
Before going ahead, let's recap two important concepts that we use in QR-DQN:
- Quantile
- Inverse cumulative distribution function (Inverse CDF)
Quantile
When we divide our distribution into equal areas of probability, they are called quantiles. For instance, as Figure 14.18 shows, we have divided our distribution into two equal areas of probabilities and we have two quantiles with 50% probability each:
Figure 14.18: 2-quantile plot
Inverse CDF (quantile function)
To understand an inverse cumulative distribution function (inverse CDF), first, let's learn what a cumulative distribution function (CDF) is.
Consider a random variable...