For more detail some of the architectures that we looked at in this chapter, I would suggest reading the following papers:
- Model-agnostic meta-learning: https://arxiv.org/pdf/1703.03400.pdf
- Optimization as a model for few-shot learning: https://openreview.net/pdf?id=rJY0-Kcll