Further Reading
Fine-Tuning Transformers: Vocabulary Transfer, Mosin et al. (2021): https://arxiv.org/abs/2112.14569Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers, Tay et al. (2022): https://arxiv.org/abs/2109.10686
Fine-Tuning Transformers: Vocabulary Transfer, Mosin et al. (2021): https://arxiv.org/abs/2112.14569Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers, Tay et al. (2022): https://arxiv.org/abs/2109.10686