Overall results
To simplify the comparison of the methods, I put all the numbers related to the best rewards obtained in the following table:
Method
|
HalfCheetah
|
Ant
|
||
PyBullet | MuJoCo | PyBullet | MuJoCo | |
A2C | 2,189 | 4,718 | 2,425 | 5,380 |
PPO | 2,567 | 1,623 | 2,560 | 5,108 |