What an LLM is and GPT as an LLM
We’ve used the term LLM a lot in this book. At this point, we need to discover what an LLM is.
At the most fundamental level, an LLM such as GPT is a machine learning model. Machine learning is a subset of AI that enables computers to learn from data. In the case of LLMs, this data is predominantly text – lots and lots of it. Imagine an LLM as a student who has read not just one or two books but millions of them, covering a wide array of topics from history and science to pop culture and memes.
The architecture – neurons and layers
The architecture of an LLM is inspired by the human brain and consists of artificial neurons organized in layers. These layers are interconnected, and each connection has a weight that is adjusted during the learning process. The architecture usually involves multiple layers, often hundreds or even thousands, making it a “deep” neural network. This depth allows the model to learn...