Summary
In this chapter, we have covered an emerging paradigm in artificial intelligence, machine teaching, which is about effectively conveying the expertise of a subject matter expert (teacher) to machine learning model training. We discussed how this is similar to how humans are educated: By building on others' knowledge. The advantage of this approach is that it greatly increases data efficiency in machine learning, and, in some cases, makes learning possible that would have been impossible without a teacher. We discussed various methods in this paradigm, including reward function engineering, curriculum learning, demonstration learning, action masking, and concept networks. We observed how some of these methods improved vanilla use of Ape-X DQN significantly.
Besides its benefits, machine teaching also has some challenges and potential downsides: First, it is usually non-trivial to come up with good reward shaping, curriculum, set of action masking conditions etc. This...