Infra

The State of Generative Media 2026

Jennifer Li and Justine Moore Posted February 19, 2026

Today, our friends at fal released the State of Generative Media Report, a deep-dive into the remarkable progress we’re seeing in generative media. fal has a particularly privileged vantage point: their inference engine serves over 600 models to millions of hobbyists, developers, and enterprises who collectively generate billions of assets. Naturally, we were eager to dig in.

Here are our top takeaways.

1) No one model to rule them all

The world of image and video models is remarkably fragmented — and intentionally so. fal’s report finds that enterprise production deployments use a median of 14 different models. This is a striking contrast to the LLM landscape, where a handful of major labs dominate. a16z’s growth strategy recently put out a report that illustrates just how concentrated that market is: OpenAI, Gemini, and Anthropic together command 89% of enterprise wallet share. In generative media, we see nothing like that.

This makes sense, and is actually something we’ve written about before. Even if a model is extraordinary at photorealistic images, or excels at anime aesthetics, or has strong physics simulation, that doesn’t mean you should use it for background removal, sound generation, or multi-shot narrative scenes. Each model tends to be strong in some areas and weaker in others.

The job of infrastructure (and a team like fal) expands accordingly: it’s not just about serving requests efficiently, but about supporting the rapid pace of new releases — rolling out new models every few weeks and providing day-0 support as the field moves faster than enterprise software typically does.

2) From inference to orchestration

Another reason why there are dozens of models used simultaneously is producing a single polished asset is rarely a single inference call. In practice, developers chain multiple models together: generate an image, remove the background, upscale it, recolor it, apply a style-consistent LoRA – to achieve the brand-level consistency and quality that one-shot prompting simply can’t deliver. The unit of work isn’t one model, it’s a workflow.

This has real implications for infrastructure: it’s not enough to serve individual models quickly. You need to orchestrate multi-step pipelines with low cumulative latency, manage dependencies between steps, and make it easy to swap in new models as the frontier moves. As we get into the territory of long form videos, this pipeline is evolving even more rapidly. Producing even a short branded film means chaining together scene generation, camera motion control, character persistence across cuts, dialogue synthesis, sound design, and post-production effects, with each step relying on a specialized model and the output of the step before it.

This is where developer tooling becomes make-or-break. If every model in the pipeline has a different API shape, auth, error handling, and async behavior, your team spends more time on plumbing than product. You need a unified interface across models, workflow primitives to chain steps into a single callable pipeline, streaming for intermediate results, and queue management for long-running jobs. The orchestration layer matters as much as the models themselves, and it’s a big part of why infrastructure like fal’s is so critical to take models from prototyping to production.

3) Not all pixels are worth the same

One thing we’ve observed is that builders have gotten remarkably savvy about model selection. The key insight: the right model depends on what you’re generating and at what scale.

If you’re producing huge volumes of small, utilitarian images — think product thumbnails or feed assets — you bias toward models that are fast and cheap, because the marginal value of perfection is low and the marginal cost compounds fast. Models like Flux are a natural fit here. Conversely, when you’re generating hero assets where polish is the priority – ad campaigns, logos, brand imagery – it makes sense to pay for something like Nano Banana Pro, because small imperfections will look unprofessional at that level of scrutiny.

But the cost calculus doesn’t stop at the model layer. Infrastructure matters too. In fal’s survey with Artificial Analysis, 58% of organizations identified cost optimization as their primary criterion when selecting model infrastructure, ahead of factors like model availability and generation speed. Competition is happening at two layers simultaneously: between infra providers racing to offer the most cost-effective run of a given model, and between models along the cost-quality frontier where the right choice depends on traffic scale and tolerance for imperfection.

4) Adoption is everywhere (but some industries are moving faster than others)

Generative media isn’t a niche anymore. Adoption is showing up across virtually every industry, from creative software to retail and commerce. Three verticals stand out: gaming, advertising, and e-commerce.

In gaming, studios are using generative models to prototype concept art, populate environments, and produce in-game assets at a pace traditional pipelines can’t touch. In advertising, the shift is dramatic – campaigns that once took weeks of production now spin up hundreds of personalized variations in hours. It’s changing the economics of creative testing and spawning entirely new startup categories. And in e-commerce, the case almost makes itself: when you need product shots, lifestyle imagery, and seasonal creative across thousands of SKUs, generative media turns what used to require a team of photographers, weeks of shoots, and long editing cycles into a few prompts and a library of production-ready assets.

5) What comes next?

2026 looks like another packed year for generative media. A few things we’re watching closely:

More capable and coherent video. Seedance 1.0 topped leaderboards in June 2025, and the previews of Seedance 2.0 are blowing us away. Other video model providers like Kling and Grok aren’t far behind – and we expect there’s more to come from Sora at OpenAI and Veo at DeepMind. The next generation of models will continue to push on multi-shot narrative consistency, character persistence across scenes, and controllability. The model releases in 2025 came every 4–6 weeks; there’s no reason to expect that pace to slow down.

World models going from prototype to product. This is arguably the most exciting frontier. In late 2025, Marble (from World Labs) showed that it’s now possible to generate persistent, interactive 3D environments from a single image or text prompt. Meanwhile, other players like Genie 3 (DeepMind) are pushing forward on real-time video that users can explore like it’s a game. We expect the applications of these world models in gaming, entertainment, simulation, and training autonomous systems will be enormous.

Open source gains ground. The closed vs. open source model debate is heating up in generative media. Enterprises are increasingly gravitating toward open-source models for production not because they’re cheaper, but because they’re customizable: when you need brand consistency, character persistence, or product fidelity across millions of generated assets, finetuning on your own data isn’t optional, it’s the whole game. Closed APIs generally don’t allow that, or offer it in very constrained ways. Meanwhile, open-source models like Flux and Qwen Image Edit closed the quality gap faster than anyone expected in 2025.

About the Contributors

Jennifer Li

is a general partner at Andreessen Horowitz, where she leads infrastructure investments with an eye on data systems, developer tools and AI.

Justine Moore

is a partner on the investing team at Andreessen Horowitz, where she focuses on AI — both foundation models and applications.

Want More a16z Infra?

Analysis and news covering the latest trends reshaping AI and infrastructure.

Learn More

Recommended For You

Infra

Your Data Agents Need Context

Jason Cui and Jennifer Li

Infra

I Built TetrisBench, Where LLMs Compete at Playing Tetris. Here’s What I Found.

Yoko Li

Infra

Most People Can’t Vibe Code. Here’s How We Fix That.

Justine Moore

Infra

It’s time for agentic video editing

Justine Moore

Infra

Matt Bornstein

Martin Casado

Want More Infra?

Analysis and news covering the latest trends reshaping AI and infrastructure.

Views expressed in “posts” (including podcasts, videos, and social media) are those of the individual a16z personnel quoted therein and are not the views of a16z Capital Management, L.L.C. (“a16z”) or its respective affiliates. a16z Capital Management is an investment adviser registered with the Securities and Exchange Commission. Registration as an investment adviser does not imply any special skill or training. The posts are not directed to any investors or potential investors, and do not constitute an offer to sell — or a solicitation of an offer to buy — any securities, and may not be used or relied upon in evaluating the merits of any investment.

The contents in here — and available on any associated distribution platforms and any public a16z online social media accounts, platforms, and sites (collectively, “content distribution outlets”) — should not be construed as or relied upon in any manner as investment, legal, tax, or other advice. You should consult your own advisers as to legal, business, tax, and other related matters concerning any investment. Any projections, estimates, forecasts, targets, prospects and/or opinions expressed in these materials are subject to change without notice and may differ or be contrary to opinions expressed by others. Any charts provided here or on a16z content distribution outlets are for informational purposes only, and should not be relied upon when making any investment decision. Certain information contained in here has been obtained from third-party sources, including from portfolio companies of funds managed by a16z. While taken from sources believed to be reliable, a16z has not independently verified such information and makes no representations about the enduring accuracy of the information or its appropriateness for a given situation. In addition, posts may include third-party advertisements; a16z has not reviewed such advertisements and does not endorse any advertising content contained therein. All content speaks only as of the date indicated.

Under no circumstances should any posts or other information provided on this website — or on associated content distribution outlets — be construed as an offer soliciting the purchase or sale of any security or interest in any pooled investment vehicle sponsored, discussed, or mentioned by a16z personnel. Nor should it be construed as an offer to provide investment advisory services; an offer to invest in an a16z-managed pooled investment vehicle will be made separately and only by means of the confidential offering documents of the specific pooled investment vehicles — which should be read in their entirety, and only to those who, among other requirements, meet certain qualifications under federal securities laws. Such investors, defined as accredited investors and qualified purchasers, are generally deemed capable of evaluating the merits and risks of prospective investments and financial matters.

There can be no assurances that a16z’s investment objectives will be achieved or investment strategies will be successful. Any investment in a vehicle managed by a16z involves a high degree of risk including the risk that the entire amount invested is lost. Any investments or portfolio companies mentioned, referred to, or described are not representative of all investments in vehicles managed by a16z and there can be no assurance that the investments will be profitable or that other investments made in the future will have similar characteristics or results. A list of investments made by funds managed by a16z is available here: https://a16z.com/investments/. Past results of a16z’s investments, pooled investment vehicles, or investment strategies are not necessarily indicative of future results. Excluded from this list are investments (and certain publicly traded cryptocurrencies/ digital assets) for which the issuer has not provided permission for a16z to disclose publicly. As for its investments in any cryptocurrency or token project, a16z is acting in its own financial interest, not necessarily in the interests of other token holders. a16z has no special role in any of these projects or power over their management. a16z does not undertake to continue to have any involvement in these projects other than as an investor and token holder, and other token holders should not expect that it will or rely on it to have any particular involvement.

With respect to funds managed by a16z that are registered in Japan, a16z will provide to any member of the Japanese public a copy of such documents as are required to be made publicly available pursuant to Article 63 of the Financial Instruments and Exchange Act of Japan. Please contact compliance@a16z.com to request such documents.

For other site terms of use, please go here. Additional important information about a16z, including our Form ADV Part 2A Brochure, is available at the SEC’s website: http://www.adviserinfo.sec.gov.

The Latest

new Investing in Treeline

new Every Building You’ve Ever Been In Was Designed By Software Built in 1997

new Investing in Glimpse

new The Algorithm That Keeps Compounding

The State of Generative Media 2026

1) No one model to rule them all

2) From inference to orchestration

3) Not all pixels are worth the same

4) Adoption is everywhere (but some industries are moving faster than others)

5) What comes next?

Jennifer Li

Justine Moore

Your Data Agents Need Context

I Built TetrisBench, Where LLMs Compete at Playing Tetris. Here’s What I Found.

Most People Can’t Vibe Code. Here’s How We Fix That.

Your Data Agents Need Context

I Built TetrisBench, Where LLMs Compete at Playing Tetris. Here’s What I Found.

Most People Can’t Vibe Code. Here’s How We Fix That.

It’s time for agentic video editing

Matt Bornstein

Your Data Agents Need Context

I Built TetrisBench, Where LLMs Compete at Playing Tetris. Here’s What I Found.

Most People Can’t Vibe Code. Here’s How We Fix That.

It’s time for agentic video editing

Matt Bornstein

State of Consumer AI 2025: Product Hits, Misses, and What’s Next

The Cinderella “Glass Slipper” Effect: Retention Rules in the AI Era

State of AI: An Empirical 100 Trillion Token Study with OpenRouter

Bitter Economics

Search Wars: Episode 2

Where you build is who you are: the ElevenLabs story

There is no God Tier video model: But there is something better

Want More Infra?

The State of Generative Media 2026

1) No one model to rule them all

2) From inference to orchestration

3) Not all pixels are worth the same

4) Adoption is everywhere (but some industries are moving faster than others)

5) What comes next?

Jennifer Li

Justine Moore

Want More Infra?

Power User Menu