Policy experiments
The first model that I trained was with the Height
objective and without zeroing the yaw component. A video of the robot executing the policy is available here: https://www.youtube.com/watch?v=u5rDogVYs9E. The movements are not very natural. In particular, the front-right leg is not moving at all. This model is available in the source tree as Chapter18/hw/libhw/t1.py
.
As this might be related to the yaw observation component, which is different during the training and inference, the model was retrained with the --zero-yaw
command-line option. The result is a bit better: all legs are now moving, but the robot's actions are still not very stable. The video is here: https://www.youtube.com/watch?v=1JVVnWNRi9k. The model used is in Chapter18/hw/libhw/t1zyh.py
.
The third experiment was done with a different training objective, HeightOrient
, which not only takes into account the height of the model, but also checks that the body of the robot is parallel to the...