📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent Google whitepaper emphasizes that in AI-assisted software development, the model itself is only 10% of the system. The focus should be on harness design and context engineering, which account for 90%. This shifts how organizations should invest in AI development.

A new Google whitepaper titled The New SDLC With Vibe Coding asserts that the most significant shift in software engineering is moving from writing code to expressing intent and trusting machines to generate software. The report emphasizes that the AI model used in development accounts for only about 10% of the overall system behavior, with the remaining 90% determined by the harness, including prompts, tools, rules, and context management.

The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, presents data showing that most AI-generated code relies heavily on the surrounding infrastructure rather than the model itself. For example, experiments on public benchmarks like Terminal Bench 2.0 demonstrated that simply modifying the harness — such as prompts, tools, or configuration — could dramatically improve performance, even when using the same AI model.

The authors argue that failures in AI agents are predominantly configuration issues, such as missing tools or vague rules, rather than flaws in the models. They emphasize that organizations should focus on building and owning their harnesses, which include the prompts, rules, and observability layers, to gain a durable advantage. The paper also stresses that context engineering — providing the right instructions, knowledge, examples, and guardrails — is more critical than clever prompting alone.

Economically, the whitepaper suggests that while vibe coding (quick prompts with minimal review) appears cheap, it incurs higher long-term costs due to token inefficiency, maintenance, and security vulnerabilities. In contrast, disciplined approaches like agentic engineering require higher initial investment but reduce marginal costs over time.

At a glance

reportWhen: published early 2026

The developmentThe whitepaper by Addy Osmani, Shubham Saboo, and Sokratis Kartakis highlights that the core of effective AI coding lies in the harness and context engineering, not the AI model itself.

The Model Is Only 10% — The New SDLC With Vibe Coding

AI Dispatch · Field Notes

Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified

Vibe Coding

Casual prompts · “does it seem to work?” · disposable code · high risk

Structured AI-Assisted

Detailed prompts + constraints · manual testing · features in real codebases

Agentic Engineering

Formal specs · automated tests + evals + CI gates · production scale · low risk

Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.

The idea worth building your strategy around

Agent = Model + Harness

~10%

HARNESS — prompts · tools · context · hooks · sandboxes · observability

MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S

Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.

“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.

The economics: it’s a token-cost problem (CapEx vs OpEx)

Vibe Coding

Low CapEx · High OpEx

Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.

Agentic Engineering

High CapEx · Low OpEx

Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.

85%

of devs use AI coding agents (51% daily)

41%

of all new code is AI-generated

~90%

of agent behavior is the harness, not the model

+19%

longer on some tasks (METR) — verification is the cost

The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.

thorstenmeyerai.com

Implications for AI Development Strategies

This whitepaper fundamentally shifts the understanding of where value lies in AI-assisted software engineering. Instead of focusing solely on acquiring the latest models, organizations should prioritize developing robust harnesses and context management. This approach offers a more cost-effective, reliable, and secure path to integrating AI into software workflows, especially as AI-generated code now constitutes up to 41% of new code.

For CTOs and engineering leaders, this means reevaluating current investments and emphasizing infrastructure, tooling, and process discipline. The shift could lead to more sustainable AI adoption, better security, and improved code quality, ultimately influencing how AI tools are integrated into enterprise development pipelines.

Harness Engineering for Web Developers: From Prompts to Context to Autonomous Agent Environment Design (AI for Web Developers Series)

As an affiliate, we earn on qualifying purchases.

Background of AI in Software Engineering

By early 2026, AI coding agents are used by 85% of professional developers, with 51% using them daily, and roughly 41% of all new code being AI-generated, according to the whitepaper. Prior to this, the industry largely viewed AI models as the core driver of code quality. However, recent experiments and industry observations have shown that the surrounding infrastructure — prompts, tools, and context management — significantly impacts AI performance.

The concept of vibe coding, popularized in early 2025, involved minimal prompts and quick fixes, often leading to unreliable outputs. The whitepaper clarifies that this approach, while seemingly cheap, leads to high long-term costs and risks. The emergence of ‘agentic engineering’ as a disciplined approach, with structured verification and context management, marks a pivotal evolution in AI development practices.

“The model accounts for only about 10% of the behavior; the harness and context engineering make up the other 90%.”
— Addy Osmani

Context Engineering for Claude: A Practitioner's Guide to CLAUDE.md, Memory Tools, and Three-Layer Workflows for Solopreneurs, Freelancers, and Product Managers

As an affiliate, we earn on qualifying purchases.

Unresolved Questions About Implementation

While the whitepaper convincingly argues that harness and context engineering are more impactful than the model itself, it does not specify how organizations should best structure their workflows or measure success in this shift. The practical steps for transitioning from vibe coding to disciplined engineering are still being developed, and the long-term effects on security and maintenance costs require further observation.

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

As an affiliate, we earn on qualifying purchases.

Next Steps for AI-Driven Software Development

Organizations are expected to begin reevaluating their AI strategies, prioritizing the development of robust harnesses and context management frameworks. Future research and industry practice will likely focus on establishing standards for context engineering, tooling, and verification processes. Additionally, companies may invest in training and tooling to better implement agentic engineering, aiming to reduce long-term costs and improve security.

Amazon

observability layers for AI systems

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of the system’s behavior?

The whitepaper explains that most of the AI agent’s output depends on how it is configured, including prompts, tools, rules, and context management, rather than the underlying model alone.

What is harness engineering in AI development?

Harness engineering involves designing and managing the surrounding infrastructure—prompts, rules, tools, and observability—that guides the AI model’s behavior effectively.

How does this shift impact AI development costs?

While vibe coding appears inexpensive initially, it often leads to higher long-term costs due to inefficiencies, maintenance, and security risks. Disciplined harness and context engineering may require higher upfront investment but reduce ongoing expenses.

What is the significance of context engineering?

Context engineering ensures the AI has the right information, instructions, and safeguards, which significantly improves output quality and reliability over simple prompt tricks.

What are the next steps for organizations adopting AI in software engineering?

Organizations should focus on building and owning their harnesses, developing structured verification processes, and training teams in context engineering to maximize AI benefits and control long-term costs.

Source: ThorstenMeyerAI.com

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Cutrova: Edit the Words, Not the Timeline

Author

press-report.net Team

The model is only 10%

Implications for AI Development Strategies

Harness Engineering for Web Developers: From Prompts to Context to Autonomous Agent Environment Design (AI for Web Developers Series)

Background of AI in Software Engineering

Context Engineering for Claude: A Practitioner's Guide to CLAUDE.md, Memory Tools, and Three-Layer Workflows for Solopreneurs, Freelancers, and Product Managers

Unresolved Questions About Implementation

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

Next Steps for AI-Driven Software Development

observability layers for AI systems

Key Questions

Why is the model only 10% of the system’s behavior?

What is harness engineering in AI development?

How does this shift impact AI development costs?

What is the significance of context engineering?

What are the next steps for organizations adopting AI in software engineering?

The Atlas. What the framework is.

AmenGate: The Moment Before The Scroll

The Machine Economy — Capital-Heavy, Human-Light, Trading With Itself

Are AI-Driven Surveillance Systems Compromising City Governance?

Owning The AI Future: SAP’s Strategy Of System Control Over Brain Outsourcing

How AI Is Making Corporate Survival A Never-Ending Show

OpenAI’s Model Breach During Benchmark Raises Security Concerns

Open-Source MiMo Code: Improving AI Operations Signal Tracking

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Author

press-report.net Team

The model is only 10%

Implications for AI Development Strategies

Harness Engineering for Web Developers: From Prompts to Context to Autonomous Agent Environment Design (AI for Web Developers Series)

Background of AI in Software Engineering

Context Engineering for Claude: A Practitioner's Guide to CLAUDE.md, Memory Tools, and Three-Layer Workflows for Solopreneurs, Freelancers, and Product Managers

Unresolved Questions About Implementation

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

Next Steps for AI-Driven Software Development

observability layers for AI systems

Key Questions

Why is the model only 10% of the system’s behavior?

What is harness engineering in AI development?

How does this shift impact AI development costs?

What is the significance of context engineering?

What are the next steps for organizations adopting AI in software engineering?

You May Also Like