The Vibe Coding Arc: From Karpathy's Tweet to the May 2026 Successor
Fifteen months from a single tweet to an industry-wide replacement term. Here is the month-by-month timeline of vibe coding's rise, peak, and quiet absorption into agentic engineering.
In February 2025, Andrej Karpathy posted a 280-character description of how he was writing code. By April 2026, the same Karpathy was on stage at Sequoia calling the same practice "passé" and introducing the successor term. In between, vibe coding became a Collins Dictionary Word of the Year contender, leaked 1.5 million API keys, spawned at least six unicorn-valued tools, and was quietly absorbed into a broader paradigm called agentic engineering.
The arc lasted fifteen months. The SERP is still six weeks behind it. Here's the complete timeline.
February 3, 2025: the tweet
Karpathy's original post is so short it fits in a footnote:
"There's a new kind of coding I call 'vibe coding', where you fully give in to the vibes, embrace exponentials, and forget that the code even exists."
He goes on to describe accepting Cursor's suggestions without reading them, asking the model to fix bugs by pasting error messages back, and "barely touching the keyboard." Simon Willison's annotation — published two days later — is the most-cited interpretation: vibe coding is a workflow for "throwaway weekend projects," not a software engineering methodology. That distinction would matter later.
The tweet was 26 million views by week-end. By month-end, "vibe coding" had a Wikipedia page.
March – May 2025: the tool wave
Tools that already existed got rebranded. Tools that didn't yet exist shipped fast. The vibe-coding tool stack of spring 2025 was:
- Bolt.new — text-to-fullstack-app, Stackblitz's bet. Hit $40M ARR by July 2025.
- Lovable — Swedish "vibe-code your SaaS" tool. Hit $50M ARR before its first birthday.
- Replit Agent — the established player, repositioning hard.
- Cursor Composer — the Pro-tier multi-file vibe loop.
- v0 (Vercel) — for React/Next.js component generation.
- Codeium / Windsurf — the IDE-native vibe assistant.
June – August 2025: the consultancy bubble
By summer, vibe coding had a consulting class. Every AI consultancy added "vibe coding workshop" to their service menu. Y Combinator's Summer 2025 batch was 44% AI-first. Sequoia, a16z, and Greylock published think pieces. The phrase "vibe coding" got into the speaker bios at three different conferences.
This is also when the criticism started landing. Uncle Bob's blog post — "Vibe coding is not the same as disciplined agentic development" — was the first widely-shared dissent from a respected source. The HN response thread, "Vibe coding: what is it good for? Absolutely nothing", pulled 600+ comments. Even people who liked it conceded the failure modes.
September 2025: AGENTS.md and the discipline turn
September is the inflection. Three things shipped:
AGENTS.md launches. OpenAI's Codex CLI ships with a config file at the project root calledAGENTS.md. It's tool-agnostic, OpenAI-led but published as an open standard, and by the end of the month it has 82 commits and 60,000+ projects using it (HN id 44957443, 382 comments). The point: developers are willing to write docs for agents that they wouldn't write for humans. This is the first sign that the industry wants more discipline, not less.
Spec Kit ships. GitHub publishes a spec-first methodology for AI coding: "specify → plan → tasks → implement." The whole point is the opposite of vibe coding — formal spec before the model writes anything.
Superpowers launches. Jesse Vincent (Prime Radiant) publishes a Claude Code methodology: brainstorm → plan → implement, with subagent-driven development, TDD discipline, and explicit verification. The HN original post and a rave review thread bring it into the mainstream. By April 2026 the repo is at 120k+ stars.
The discipline turn isn't coordinated, but it's unmistakable. The tools that ship in September 2025 all share an assumption: the agent works better when you tell it more, not less.
October – November 2025: the security wake-up
In October, Vibehouse leaks 1.5 million API keys. It's a vibe-coded admin panel with no auth, exposed publicly, scraped by a researcher who reported it before bad actors found it. The number gets cited everywhere. By December, Veracode's 2025 report says 45% of AI-generated code introduces security vulnerabilities.
The narrative shifts. Vibe coding is fine for a weekend prototype. Vibe coding is not fine for the thing that has your customers' payment data. Every blog post about vibe coding for the next quarter has a paragraph that starts "but for production…".
December 2025: Karpathy inverts
In a year-end thread, Karpathy admits his own ratio inverted. Where he used to write 80% of his own code and delegate 20% to agents, he now delegates 80% and writes 20%. The phrasing is casual but the implication is loud: even the term's inventor isn't doing pure vibe coding anymore. The replies fill with people saying the same thing happened to them in November.
January 2026: AGENTS.md becomes a standard
Linux Foundation announces the Agentic AI Foundation. AGENTS.md is one of the foundation's first stewardship projects. Tool adoption: Codex, Cursor, Copilot, Gemini, Jules, Amp, Factory all read AGENTS.md natively. The "vibe code your app" era was about removing structure. The AGENTS.md era is about putting it back, in a form the model can read.Two of the three biggest LLM eval tools also get acquired in this window — Langfuse → ClickHouse in January, Promptfoo → OpenAI in March. Eval and observability mature in parallel to the discipline turn. (We covered the implications in a separate post.)
March 7, 2026: autoresearch
Karpathy publishes a new repo: autoresearch. The announcement tweet hits 8.6 million views in two days. The repo is 630 lines and explicitly says "you don't use it directly — you give it to your agent." Within six weeks, four Claude Code ports exist with combined 17k+ stars:
- uditgoenka/autoresearch — 4.4k stars, cleanest port.
- wanshuiyin/ARIS — 9k stars, with review-and-revise loop.
- drivelineresearch/... — technical/scientific port.
- Maleick/autoresearch-claude — long-form report generator.
April 2026: Sequoia calls it
At Sequoia's AI Ascent 2026, Stephanie Zhan introduces Karpathy and uses the phrase that ends the era. Paraphrasing the conversation:
"Vibe coding raised the floor. Agentic engineering raises the ceiling."
Karpathy agrees, then is more direct: vibe coding is "passé." The replacement is agentic engineering: same AI tooling, more discipline, treating the agent like a junior developer who needs context, specs, evals, and review — not like a magic wand. Simon Willison's May 6 follow-up is the canonical synthesis: the same tools work for both, but you ship better software with the discipline.
May 2026: where we are
The vibe coding era is over in vocabulary terms — Karpathy moved on, Simon Willison moved on, the discourse moved on. But it's not over in practice. Bolt, Lovable, v0, Cursor Composer, and Replit Agent are all bigger than they were a year ago. The tools are the same; the methodology around them changed.
Here is the tool map for May 2026. Where each tool sits on the vibe ↔ discipline axis:
| Phase | Tools | What they reward |
|---|---|---|
| Pure vibe (Feb 2025) | Cursor accept-all, Replit Agent, Bolt.new, Lovable, v0 | Speed-to-prototype |
| Vibe + Cursor rules (Spring 2025) | Cursor with .cursorrules, Windsurf with .windsurfrules | Speed + constraints |
| Methodology overlay (Sept 2025) | Claude Code + CLAUDE.md, AGENTS.md, Spec Kit, Superpowers | Repeatable shipping |
| Discipline + autonomy (March 2026) | Claude Code + skills + autoresearch + subagents | Long-horizon work |
| Agentic engineering (May 2026) | Methodology + skills + evals + subagents + cross-tool configs | Production systems |
What replaced vibe coding (with specifics)
The "agentic engineering" successor is the umbrella term. Underneath it, three specific practices replaced the parts of vibe coding that broke.
1. Configuration files replaced ad-hoc prompts. CLAUDE.md, AGENTS.md, SKILL.md — three files that encode the project context the agent needs. The HumanLayer rule is that CLAUDE.md should be under 60 lines, because frontier models handle ~150-200 instructions reliably and lose discipline past that. The whole point is the opposite of vibe: tell the agent more, explicitly, ahead of time. 2. Skills replaced one-shot prompts. Anthropic's Claude Code skills system is the model-native version: a skill is a triggered context bundle that loads only when needed. The result is a 4-line context budget instead of a 400-line one. An always-on rule is a waste of context. A triggered skill is a precision instrument. RuleSell exists because the gap between "I have a skill" and "I have a skill that's been audited and won't break my codebase" is real. 3. Eval and review replaced "ship it and see." Both Hamel Husain and Shreya Shankar have been saying the same thing for two years: 60-80% of dev time on error analysis, review at least 100 traces, build the annotation UI into your app. The two eval-tool acquisitions in early 2026 (Langfuse, Promptfoo) made the tooling story messier but the principle is unchanged.The five-step upgrade path
If you started in vibe coding and want to land in agentic engineering, here is the path that worked for the teams we've watched do it:
Step 1: Write a 50-line CLAUDE.md. Stack, conventions, deny-rules for what the agent should never touch. Done. Step 2: Convert your favorite repeat-prompts into skills. If you've typed "write a Next.js API route with Zod validation and Prisma" more than three times, that's a skill. See the build a Claude Code skill guide. Step 3: Add an AGENTS.md if your team uses more than one agent tool. The decision tree is short: one tool → CLAUDE.md only. Multiple tools → AGENTS.md primary, CLAUDE.md pointer. Step 4: Add evals for the part that ships to customers. Not for everything — for the customer-facing LLM features. DeepEval or Inspect AI are the open-source picks since Promptfoo got acquired. Step 5: Layer in autoresearch or subagents for long-horizon work. If you have a question that needs four hours of investigation, autoresearch is the pattern. If you have a task that has four parallel parts, subagents is the pattern. Don't add either before steps 1-4 are in place.Where this analysis fails
We don't actually know if "agentic engineering" sticks as a term. Karpathy says it, Sequoia says it, Simon Willison says it. That's three points. The 2025 industry adoption of "vibe coding" had hundreds of points within weeks. The phrase could fade by August.
We also don't know which tools survive. Bolt, Lovable, and v0 are in different shapes than they were six months ago. Claude Code is gaining share. Cursor is defending. Aider is having a moment with the Polyglot benchmark. The substrate is moving faster than the methodology.
The thing we are confident about: the tools that get installed alongside discipline (eval, configs, skills, subagents) compound. The tools that get installed instead of discipline don't. That is the load-bearing finding from fifteen months of watching the arc.
What to read next
- /topic/agentic-engineering — the methodology, deep dive
- /topic/yolo-mode — the
--dangerously-skip-permissionsplaybook, safely - /topic/autoresearch — the autonomous research pattern
- /topic/claude-md — the 4-line CLAUDE.md that outranks the 400-line one
- /for/solo-developers — agentic engineering for one-person teams
- /for/startups — shipping AI features in your first 30 days with discipline
Sources
- Karpathy's vibe coding tweet (Feb 3, 2025) — x.com/karpathy/status/1886192184808149383
- Simon Willison annotation — simonwillison.net/2025/Feb/3/vibe-coding
- Vibe coding HN dissent — news.ycombinator.com/item?id=46064998
- AGENTS.md HN launch — news.ycombinator.com/item?id=44957443
- Superpowers original post — news.ycombinator.com/item?id=45547344
- Karpathy autoresearch tweet (March 7, 2026) — x.com/karpathy/status/2030371219518931079
- Sequoia AI Ascent 2026 — sequoiacap.com/podcast/karpathy-agentic-engineering
- Simon Willison May 2026 synthesis — simonwillison.net/2026/May/6/vibe-coding-and-agentic-engineering
- Veracode 2025 — veracode.com/state-of-software-security-2025
- Hamel Husain evals FAQ — hamel.dev/blog/posts/evals-faq