References
Please go through the following content for more information on a few of the topics covered in this chapter:
- Papers With Code: https://paperswithcode.com/datasets.
- Hugging Face Hub: https://huggingface.co/datasets
- Hugging Face: https://huggingface.co/datasets?task_ids=task_ids:language-modeling&sort=downloads
- AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE: https://arxiv.org/pdf/2010.11929.pdf
- Scaling Laws for Neural Language Models: https://arxiv.org/pdf/2001.08361.pdf
- Training Compute-Optimal Large Language Models: https://arxiv.org/pdf/2203.15556.pdf
- BigScience Episode #5 – Challenges & Perspectives in Creating Large Language Models: https://bigscience.huggingface.co/acl-2022