What’s been done with modern AI models is the creation of a new type of technology that can understand and generate human language, create art, and solve complex problems. Why exactly are they so revolutionary? Allow us to explain.
These AI models aren’t just fancy calculators or “if-then” machines. They can process and generate information in ways that mimic human cognition. To top it off, they can do it at a level that’s simply spectacular.
Consider that a publicly available large language model (LLM) can train on trillions of words (tokens). If someone had compressed these into a regular zip file, the words would have occupied thousands of gigabytes. Yet, the final model might only be tens of gigabytes in size — stored almost entirely in one file.
The model can do this because it learns patterns from training data instead of just storing it to retrieve later. The AI model generates responses that are right up there with the original data, but never a perfect copy.