How constant time complexity O(1) of an operation changed our lives forever
How could the deceivingly simple O(1) time complexity class forever change AI and our everyday lives? How could O(1) for an operation explain the profound architectural changes that made ChatGPT so powerful and stunned the world? How can something as simple as O(1) allow systems like ChatGPT to spread to every domain and hundreds of tasks?
This section will provide a significant answer to those questions before we move on to see how one token (a minimal piece of a word) started an AI revolution that is raging around the world, triggering automation never seen before.
We need to get to the bottom of the chaos and disruption generated by transformers.
To achieve that goal, we will use science and technology to understand how all of this started. First, we will examine O(1) and then the complexity of a layer through a Python and PyTorch notebook.
Let’s first get the core concepts and terminology...