The Model Is Only 10%: The Real Lesson of the New SDLC

📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent Google whitepaper emphasizes that in AI-assisted software development, the model itself is only 10% of the system. The focus should be on harness design and context engineering, which account for 90%. This shifts how organizations should invest in AI development.

A new Google whitepaper titled The New SDLC With Vibe Coding asserts that the most significant shift in software engineering is moving from writing code to expressing intent and trusting machines to generate software. The report emphasizes that the AI model used in development accounts for only about 10% of the overall system behavior, with the remaining 90% determined by the harness, including prompts, tools, rules, and context management.

The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, presents data showing that most AI-generated code relies heavily on the surrounding infrastructure rather than the model itself. For example, experiments on public benchmarks like Terminal Bench 2.0 demonstrated that simply modifying the harness — such as prompts, tools, or configuration — could dramatically improve performance, even when using the same AI model.

The authors argue that failures in AI agents are predominantly configuration issues, such as missing tools or vague rules, rather than flaws in the models. They emphasize that organizations should focus on building and owning their harnesses, which include the prompts, rules, and observability layers, to gain a durable advantage. The paper also stresses that context engineering — providing the right instructions, knowledge, examples, and guardrails — is more critical than clever prompting alone.

Economically, the whitepaper suggests that while vibe coding (quick prompts with minimal review) appears cheap, it incurs higher long-term costs due to token inefficiency, maintenance, and security vulnerabilities. In contrast, disciplined approaches like agentic engineering require higher initial investment but reduce marginal costs over time.

At a glance
reportWhen: published early 2026
The developmentThe whitepaper by Addy Osmani, Shubham Saboo, and Sokratis Kartakis highlights that the core of effective AI coding lies in the harness and context engineering, not the AI model itself.
The Model Is Only 10% — The New SDLC With Vibe Coding
AI Dispatch · Field Notes
Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified
Vibe Coding
Casual prompts · “does it seem to work?” · disposable code · high risk
Structured AI-Assisted
Detailed prompts + constraints · manual testing · features in real codebases
Agentic Engineering
Formal specs · automated tests + evals + CI gates · production scale · low risk
Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.
The idea worth building your strategy around
Agent = Model + Harness
~10%
HARNESS — prompts · tools · context · hooks · sandboxes · observability
MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S
Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.
“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.
The economics: it’s a token-cost problem (CapEx vs OpEx)
Vibe Coding
Low CapEx · High OpEx
Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.
Agentic Engineering
High CapEx · Low OpEx
Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.
85%
of devs use AI coding agents (51% daily)
41%
of all new code is AI-generated
~90%
of agent behavior is the harness, not the model
+19%
longer on some tasks (METR) — verification is the cost
The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.
thorstenmeyerai.com

Implications for AI Development Strategies

This whitepaper fundamentally shifts the understanding of where value lies in AI-assisted software engineering. Instead of focusing solely on acquiring the latest models, organizations should prioritize developing robust harnesses and context management. This approach offers a more cost-effective, reliable, and secure path to integrating AI into software workflows, especially as AI-generated code now constitutes up to 41% of new code.

For CTOs and engineering leaders, this means reevaluating current investments and emphasizing infrastructure, tooling, and process discipline. The shift could lead to more sustainable AI adoption, better security, and improved code quality, ultimately influencing how AI tools are integrated into enterprise development pipelines.

Amazon

AI harness design tools

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Background of AI in Software Engineering

By early 2026, AI coding agents are used by 85% of professional developers, with 51% using them daily, and roughly 41% of all new code being AI-generated, according to the whitepaper. Prior to this, the industry largely viewed AI models as the core driver of code quality. However, recent experiments and industry observations have shown that the surrounding infrastructure — prompts, tools, and context management — significantly impacts AI performance.

The concept of vibe coding, popularized in early 2025, involved minimal prompts and quick fixes, often leading to unreliable outputs. The whitepaper clarifies that this approach, while seemingly cheap, leads to high long-term costs and risks. The emergence of ‘agentic engineering’ as a disciplined approach, with structured verification and context management, marks a pivotal evolution in AI development practices.

“The model accounts for only about 10% of the behavior; the harness and context engineering make up the other 90%.”

— Addy Osmani

Amazon

context engineering software development

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Unresolved Questions About Implementation

While the whitepaper convincingly argues that harness and context engineering are more impactful than the model itself, it does not specify how organizations should best structure their workflows or measure success in this shift. The practical steps for transitioning from vibe coding to disciplined engineering are still being developed, and the long-term effects on security and maintenance costs require further observation.

Amazon

AI prompt engineering tools

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps for AI-Driven Software Development

Organizations are expected to begin reevaluating their AI strategies, prioritizing the development of robust harnesses and context management frameworks. Future research and industry practice will likely focus on establishing standards for context engineering, tooling, and verification processes. Additionally, companies may invest in training and tooling to better implement agentic engineering, aiming to reduce long-term costs and improve security.

Amazon

observability layers for AI systems

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of the system’s behavior?

The whitepaper explains that most of the AI agent’s output depends on how it is configured, including prompts, tools, rules, and context management, rather than the underlying model alone.

What is harness engineering in AI development?

Harness engineering involves designing and managing the surrounding infrastructure—prompts, rules, tools, and observability—that guides the AI model’s behavior effectively.

How does this shift impact AI development costs?

While vibe coding appears inexpensive initially, it often leads to higher long-term costs due to inefficiencies, maintenance, and security risks. Disciplined harness and context engineering may require higher upfront investment but reduce ongoing expenses.

What is the significance of context engineering?

Context engineering ensures the AI has the right information, instructions, and safeguards, which significantly improves output quality and reliability over simple prompt tricks.

What are the next steps for organizations adopting AI in software engineering?

Organizations should focus on building and owning their harnesses, developing structured verification processes, and training teams in context engineering to maximize AI benefits and control long-term costs.

Source: ThorstenMeyerAI.com

You May Also Like

After the Paycheck: The Book I Wrote Because Nobody Else Would Tell the Truth About AI and Your Income

Author Thorsten Meyer releases ‘After the Paycheck,’ analyzing AI’s effect on jobs, ownership, and the economy, emphasizing ownership as the key to future stability.

World Model Readiness: Are You Ready for AI That Acts?

Assess your organization’s readiness for AI systems capable of predicting and acting, as world models become central to AI development and deployment.

Engineering Is Automated. Research Is the Residual.

Recent developments show AI has automated much of engineering work, leaving research as the remaining challenge. This shift could reshape AI development timelines.

The stake. Why the answer to automation is broad-based ownership, not a bigger transfer.

Thorsten Meyer argues that expanding capital ownership, not redistribution, is the market-friendly solution to AI-driven value shifts from labor to capital.