The past year in AI has been defined by rapid shifts in capability, adoption, and developer behavior. Breakthrough reasoning models, the acceleration of open-source innovation, and a surge in AI-native applications have reshaped how we build and interact with intelligent systems. Today, we are releasing a new empirical study that examines these changes through a particularly comprehensive lens: more than 100 trillion tokens of real-world LLM usage from OpenRouter.
OpenRouter now serves over 5+ million developers and routes traffic across 300+ models from more than 60 providers. The platform has grown from handling roughly 10 trillion tokens per year to more than 100 trillion as of mid-2025. Just last week, OpenRouter processed more than 1 trillion tokens every single day. For perspective, OpenAI’s entire API averaged about 8.6 trillion tokens per day in October. This scale gives OpenRouter a particularly comprehensive view into how developers use AI across industries, geographies, and model families.
Our goal with this study is simple: provide the largest scale empirical picture yet of what people do with AI today, and what that tells us about the next chapter of the industry.
A year ago, most models excelled at surface-level pattern prediction. They produced fluent text, but struggled with multi-step reasoning and planning. That changed on December 5, 2024 with the release of OpenAI’s first full-fledged reasoning model, o1 (Strawberry). It marked the beginning of a new phase where models do not just answer questions but break down problems, search for information, and evaluate paths.
Our data shows this shift playing out in practice. The fastest-growing behavior on OpenRouter is what we call agentic inference. Developers are increasingly building workflows where the model acts in extended sequences rather than single prompts. A typical interaction no longer stops at a response. Instead, the model plans, retrieves context from tools or APIs, revises outputs, and iterates until the task is complete. Prompt lengths are increasing. Sessions have more turns. Specialized reasoning and tool-use models are gaining share.
The implication is profound. AI is moving from a static chat interface to an active participant in work. The competitive frontier is no longer only about accuracy or benchmarks. It is about orchestration, control, and a model’s ability to operate as a reliable agent. For founders, this represents a strategic opening. Products that embrace these workflows early will define the next generation of AI-native applications.
Because OpenRouter aggregates open and closed models at scale, the data tells a richer story of competition than benchmark leaderboards alone.
Several patterns stand out:
These insights are difficult to see from traditional benchmarks. They emerge only when examining large-scale, real-world interactions across models and workloads.
The shift toward agentic behavior and multi-step workflows is not a theoretical prediction. It is already visible in production traffic. As the industry matures, the winners will be those who capitalize on this shift by building for reasoning, tool use, persistence, and long-horizon tasks.
For researchers, the dataset surfaces fresh questions. Why is roleplay so dominant across models? What patterns in tool-augmented usage hint at future architectures? What retention curves can predict the next breakout capability? Real-world data at this scale has been missing from the field. This study begins to shade some lights on these questions.
The State of AI: An Empirical 100 Trillion Token Study with OpenRouter provides a deeper analysis of these trends, including open vs closed source model usages, geographic patterns, category taxonomies, and cohort-based insights into long-term engagement.
You can access the full report here on OpenRouter. It is a comprehensive resource for anyone building, researching, or investing in AI. Understanding how 100 trillion tokens are used in the real world offers a data-driven guide to the future of the ecosystem.
This is just the beginning of a richer conversation grounded in real world usage. We invite you to dive in.