The Question Everyone’s Asking
At Meta’s @Scale conference in June 2026, Boris Cherny — the creator of Claude Code — got a pointed question from the audience: are loops the next hype cycle, or are they for real?
His answer was immediate.
Yes, they’re for real.
That’s not a throwaway soundbite. Cherny is one of the more credible voices in AI developer tooling, and his framing was precise: two years ago, humans wrote code. Then agents started writing code. Now, agents are prompting agents that write code. Each step, he argued, is as significant as the last.
That’s a bold claim. Let’s stress-test it.
What an AI Loop Actually Is

Before the hype gets too thick, it helps to be clear about what we’re talking about.
An AI agent loop is a continuous, autonomous workflow where one or more agents run indefinitely — not waiting for a human prompt, not stopping at task completion, just… looping. In Cherny’s own setup, one agent perpetually hunts for architectural improvements in a codebase. Another looks for duplicated abstractions to unify. Both submit pull requests like any other contributor and never stop running.
It sounds almost mundane when you put it that way. But the implications are significant.
This Isn’t Entirely New
Recursive loops are a staple of intro computer science. A function calls itself, repeats an action, and stops when a condition is met. AI loops follow the same basic logic — with one key difference: the stopping condition is non-deterministic. A subagent decides when the loop is done, not a hard-coded rule.
That shift from deterministic to judgment-based termination is where things get interesting — and occasionally chaotic.
The Ralph Loop and Other Tricks
One of the more popular patterns already circulating among developers is the Ralph Loop — named, brilliantly, after Ralph Wiggum from The Simpsons.
The mechanic is almost embarrassingly simple: summarize everything the model has done so far, then ask it whether it’s accomplished its goal. It’s a reset mechanism for when long-running agents lose the plot. Bounce the model back to its objective, let it reassess, keep going.
It’s not elegant. It works.
This kind of pragmatic, slightly hacky ingenuity is a good sign. When developers are already building folklore around a technique, it usually means the technique is solving a real problem.
The Test-Time Compute Angle

There’s a deeper structural reason loops are gaining traction, and it connects to a broader trend in AI research.
OpenAI researcher Noam Brown made the observation recently that contemporary models can solve nearly any problem if you throw enough compute at them at inference time. The implication is straightforward: if you can’t make the model smarter, you can make it try harder — and longer.
Agent loops are essentially a productized version of that insight. For hill-climbing problems — like improving a codebase incrementally — there’s no natural finish line. The model can just keep making things better until you tell it to stop, or until the budget runs out.
That’s a genuinely new kind of software development workflow. And it maps cleanly onto the kinds of problems that are hardest to scope in advance.
The Uncomfortable Part: Cost
Here’s where the enthusiasm needs a reality check.
Agent loops burn tokens fast. Much faster than a chatbot answering a question. And because the entire point is to keep the loop running continuously, there’s no natural ceiling on spend. It’s compute-on-tap, indefinitely.
For Anthropic, that’s a business model. For everyone else, it’s a budget conversation.
The honest framing is this: loops are powerful for the right problems, with the right guardrails around token spend, drift detection, and output oversight. Without those guardrails, you’re not running an intelligent loop — you’re running an expensive one.
The tooling to manage this responsibly is still catching up to the ambition.
What This Means for the Ecosystem
The shift from single-agent to multi-agent to looping-agent workflows is compressing fast. For founders and developers watching the AI tools space, a few things are worth tracking:
New tool categories are emerging. Loop orchestration, agent monitoring, drift detection, and token budget management are all becoming real product surfaces — not just engineering concerns.
The trust bar is rising. Authorizing a swarm of agents to work continuously in the background is a fundamentally different relationship with AI than reviewing a single output. Tools that help humans stay in the loop — without being in every loop — will matter.
Pricing models will shift. Flat-rate subscriptions don’t fit continuous compute consumption well. Expect more usage-based pricing, and more pressure on teams to instrument their AI workflows like infrastructure.
The Verdict
Are AI agent loops hype? Partially — the breathless framing will get ahead of the reality for a while, as it always does.
Are they real? Also yes. The underlying mechanics are sound, the use cases for continuous improvement workflows are genuine, and the people building with them are already developing practical patterns.
The smarter question isn’t whether loops matter. It’s whether your team has the infrastructure — and the discipline — to run them without the costs quietly eating you alive.
Observe carefully. Choose the loop that has an off switch.
Comments (0) No comments yet
Want to join this discussion? Login or Register.
No comments yet. Be the first to share your thoughts!