Last updated on February 17, 2026 by Editorial Team
Author(s): Utkarsh Mittal
Originally published on Towards AI.
Why is guessing the last game?
Pre-training a marginally large language model typically consumes 15 trillion to 30 trillion tokens. This seems like a very large number – until you do the arithmetic on the estimation side. There are approximately 7 to 8 billion people on the planet. If each person sent only one query per day on a model like ChatGPT, and each query consumed about 2,000 tokens when you take into account both the input prompt and the generated output, this alone would amount to about 14 trillion tokens – daily. One day of modest global usage is roughly equivalent to the entire token budget of pre-training. And in fact, heavy users are sending dozens or even hundreds of queries every day. Increase this to 100 questions per person per day, and you need 100× more tokens every day These were first used to train models.

This article highlights the changing economics of artificial intelligence, with an emphasis on the shift from training to inference, which is becoming increasingly complex and expensive with the proliferation of large language models. While companies race to improve their training capabilities, the real expenses arise from guesswork, where models have to process trillions of tokens per day. The article discusses the intricacies of how language models like Llama 3 operate, detailing the steps involved in generating responses, the computational demands at each step, and the challenges of ensuring efficient GPU utilization. Ultimately, understanding this workflow is essential to optimizing AI deployment and effectively managing associated costs.
Read the entire blog for free on Medium.
Published via Towards AI
We build enterprise-grade AI. We will also teach you how to master it.
15 Engineers. 100,000+ students. The AI Academy side teaches what actually avoids production.
Get started for free – no commitments:
→ 6-Day Agent AI Engineering Email Guide – One Practical Lesson Per Day
→ Agents Architecture Cheatsheet – 3 Years of Architecture Decisions in 6 Pages
Our courses:
→ AI Engineering Certification – 90+ lessons from project selection to deployed product. The most comprehensive practical LLM course.
→ Agent Engineering Course – Hands-on with production agent architectures, memory, routing, and eval frameworks – built from real enterprise engagements.
→ AI for Work – Understand, evaluate, and apply AI to complex work tasks.
Comment: The content of the article represents the views of the contributing authors and not those of AI.