Infra

It’s time for agentic video editing

Justine Moore Posted January 21, 2026

2025 was the year of video. AI-generated ads went mainstream. Launch videos from seed stage startups got millions of views. Video podcasts and interviews exploded.

What you didn’t see was all the work behind the scenes. Cutting 90 minutes of footage into a three-minute short. Correcting lighting and audio in post-production because you couldn’t nail it in the shoot. Searching for the right music and sound effects.

A common rule of thumb in video production is that you’ll spend 80% of your time & energy on editing, and 20% on filming (or now, generating). Crafting compelling video is typically a long and tedious process – and few people have the “taste” to do it right. There’s a significant barrier to entry.

We now have the technology to hand over some of this work to AI agents, which can help us produce both filmed and generated content. Vision models can watch and comprehend massive amounts of video footage. Agents can analyze, plan, and use editing tools on your behalf. And we have enough training data to teach models what makes a video great.

Video agents will blow out the supply curve for quality video – the kind of content that requires days (or weeks) from professional video editors today. What Cursor did for coding, these agents will do for video production.

Why now?

There’s immense demand for agents that give anyone the skills (and taste) of a professional video editor. So why don’t these products already exist? There have been a few recent developments that have unlocked progress here:

 

  1. Vision models can now process large amounts of video. You have to understand video before you can edit it. This is a non-trivial challenge – there’s a lot of information to process, even in a short clip. We’ve seen a lot of progress with recent LLMs like Gemini 3, GPT-5.2, Molmo 2, and Vidi2, which are inherently multimodal and have longer context windows. Gemini 3 can now process up to an hour of video! You can upload it as an input and ask the model to generate timestamped labels, find a specific moment, or just summarize what’s happening.
  1. Models can now use tools. AI video editors need to be able to take action – not just describe what’s happening or suggest changes. We’re starting to see meaningful progress around LLMs as real agents that can use tools. One of my favorite examples of this is Claude using Blender (a notoriously tricky product that many humans haven’t mastered). You can imagine how this evolves as you give agents access to more tools.
  1. Image and video generation models have improved. I’m a big believer that many video production pipelines will be hybrid – a mix of AI and filmed content. Imagine filming interviews for a documentary, but generating B-roll or historical footage with AI. Or using a motion transfer model to take a reference animation and apply it to a real character. For any of these things to work, models needed to reach a level of quality & consistency to be valuable. Now, that’s finally happening.

What will these agents do?

A few examples of the types of tasks they’ll be able to tackle for us:

 

  1. Process – whether you’re filming or generating a video, you’ll likely end up with a lot more footage than you need (sometimes by a factor of hundreds – imagine how many “takes” there are for each scene of a movie or TV show). It’s often a challenge to sort through all this footage, organize it, and decide what to use. Products like Eddie AI can take hours of uploaded video and do things like identifying A-roll vs. B-roll, processing multiple camera angles, and comparing takes.

     

  2. Orchestrate – if we assume that many videos are going to include some element of AI in the future, we’re going to need agents that orchestrate all of the models. For example – imagine you want to add an AI animation to an educational video. You’ll need an agent that can generate the images, send them to a video model, and stitch the outputs together. Products like Glif are launching agents that coordinate between multiple models on a user’s behalf.

    https://x.com/venturetwins/status/2010432732149788778

  3. Polish – fixing the small details often end up taking a video from good to great. But if you’re not a professional video editor, you may be overwhelmed by the flood of little tasks needed to polish a video. For example – you may want to adjust lighting between clips, clean noise out of the audio track, or take out filler words (“ummms” and “uhhhs”) during an interview. Products like Descript’s Underlord agent can take a video, make all these changes for you, and deliver the final version.

    https://x.com/venturetwins/status/1930332407603048691

  4. Adapt – when you make a good video, it often makes sense to adapt it for more reach. For example, you may want to cut a YouTube podcast into short clips with different aspect ratios to post on your X, Instagram, and TikTok accounts. Or even translate a video into other languages (and re-dub the speakers) to reach an international audience. Platforms like Overlap allow you to set up node workflows for these adaptation tasks.
  5. Optimize – the ultimate goal isn’t just replacing manual tasks with AI. It’s building agents with taste that can make your videos better. There’s a reason people hire professional video editors: they make things look good. They spend years learning everything from how to hook viewers to pacing the storyline and using music to build an emotional reaction. There are thousands of micro-decisions there. YouTuber Emma Chamberlain famously said that she used to spend 30-40 hours editing a ~15 minute vlog.What if an AI agent could watch your footage, ask about your objectives, and then craft a few draft versions of a video for you to iterate on? You review and direct – “The opening is too slow.” “Cut the middle section.” “Make the ending hit harder” – and the agent executes.

 

Video has won. It’s how we learn, market, and connect. But the editing bottleneck keeps growing. More footage captured, more platforms to publish on, more formats required.

The good news is that technology to solve this exists. Vision models, tool-using agents, and massive amounts of training data have all matured in the past year. The pieces are in place.

This means that AI editing agents will dramatically increase the quality of all video we see in the coming months and years, along with the rate at which it’s created.

2025 was the year of video. 2026 is the year we let agents edit it.

If you’re building anything in the AI video space reach out on X at @venturetwins or email at jmoore@a16z.com. I especially want to talk if you’re building agents that can help me edit my companion video to this blogpost (see below!).

Want More a16z Infra?

Analysis and news covering the latest trends reshaping AI and infrastructure.

Learn More
Recommended For You

Want More Infra?

Analysis and news covering the latest trends reshaping AI and infrastructure.

Sign Up On Substack

Views expressed in “posts” (including podcasts, videos, and social media) are those of the individual a16z personnel quoted therein and are not the views of a16z Capital Management, L.L.C. (“a16z”) or its respective affiliates. a16z Capital Management is an investment adviser registered with the Securities and Exchange Commission. Registration as an investment adviser does not imply any special skill or training. The posts are not directed to any investors or potential investors, and do not constitute an offer to sell — or a solicitation of an offer to buy — any securities, and may not be used or relied upon in evaluating the merits of any investment.

The contents in here — and available on any associated distribution platforms and any public a16z online social media accounts, platforms, and sites (collectively, “content distribution outlets”) — should not be construed as or relied upon in any manner as investment, legal, tax, or other advice. You should consult your own advisers as to legal, business, tax, and other related matters concerning any investment. Any projections, estimates, forecasts, targets, prospects and/or opinions expressed in these materials are subject to change without notice and may differ or be contrary to opinions expressed by others. Any charts provided here or on a16z content distribution outlets are for informational purposes only, and should not be relied upon when making any investment decision. Certain information contained in here has been obtained from third-party sources, including from portfolio companies of funds managed by a16z. While taken from sources believed to be reliable, a16z has not independently verified such information and makes no representations about the enduring accuracy of the information or its appropriateness for a given situation. In addition, posts may include third-party advertisements; a16z has not reviewed such advertisements and does not endorse any advertising content contained therein. All content speaks only as of the date indicated.

Under no circumstances should any posts or other information provided on this website — or on associated content distribution outlets — be construed as an offer soliciting the purchase or sale of any security or interest in any pooled investment vehicle sponsored, discussed, or mentioned by a16z personnel. Nor should it be construed as an offer to provide investment advisory services; an offer to invest in an a16z-managed pooled investment vehicle will be made separately and only by means of the confidential offering documents of the specific pooled investment vehicles — which should be read in their entirety, and only to those who, among other requirements, meet certain qualifications under federal securities laws. Such investors, defined as accredited investors and qualified purchasers, are generally deemed capable of evaluating the merits and risks of prospective investments and financial matters.

There can be no assurances that a16z’s investment objectives will be achieved or investment strategies will be successful. Any investment in a vehicle managed by a16z involves a high degree of risk including the risk that the entire amount invested is lost. Any investments or portfolio companies mentioned, referred to, or described are not representative of all investments in vehicles managed by a16z and there can be no assurance that the investments will be profitable or that other investments made in the future will have similar characteristics or results. A list of investments made by funds managed by a16z is available here: https://a16z.com/investments/. Past results of a16z’s investments, pooled investment vehicles, or investment strategies are not necessarily indicative of future results. Excluded from this list are investments (and certain publicly traded cryptocurrencies/ digital assets) for which the issuer has not provided permission for a16z to disclose publicly. As for its investments in any cryptocurrency or token project, a16z is acting in its own financial interest, not necessarily in the interests of other token holders. a16z has no special role in any of these projects or power over their management. a16z does not undertake to continue to have any involvement in these projects other than as an investor and token holder, and other token holders should not expect that it will or rely on it to have any particular involvement.

With respect to funds managed by a16z that are registered in Japan, a16z will provide to any member of the Japanese public a copy of such documents as are required to be made publicly available pursuant to Article 63 of the Financial Instruments and Exchange Act of Japan. Please contact compliance@a16z.com to request such documents.

For other site terms of use, please go here. Additional important information about a16z, including our Form ADV Part 2A Brochure, is available at the SEC’s website: http://www.adviserinfo.sec.gov.