📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent whitepaper from Google reveals that in AI software development, the model itself accounts for only about 10% of system behavior. The key to effective AI engineering lies in harness design and context engineering, not just the model size.

Google’s latest whitepaper on the Software Development Life Cycle (SDLC) with AI coding emphasizes a counterintuitive but crucial insight: the AI model accounts for only about 10% of the system’s behavior. The real focus should be on the harness, context engineering, and configuration, which together determine 90% of the outcome. This shift has significant implications for how organizations develop and maintain AI systems.

The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, highlights that the dominant factor in AI system performance is not the underlying model but the surrounding scaffolding — including prompts, rules, tools, and observability. Evidence from public benchmarks shows that a model’s performance can be dramatically improved by simply changing its harness, such as prompts or tool integrations, rather than upgrading the model itself.

Furthermore, the authors introduce the concept of ‘agentic engineering,’ where AI is integrated into formal specifications, automated tests, and rigorous verification processes. They argue that this approach, which emphasizes configuration and context, is more cost-effective and scalable than vibe coding — quick prompts and minimal review — which often leads to higher long-term costs due to inefficiencies and vulnerabilities.

At a glance

reportWhen: published March 2026

The developmentGoogle’s new whitepaper argues that the critical factor in AI software development is the harness and context engineering, not the AI model itself, marking a paradigm shift in SDLC strategies.

The Model Is Only 10% — The New SDLC With Vibe Coding

AI Dispatch · Field Notes

Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified

Vibe Coding

Casual prompts · “does it seem to work?” · disposable code · high risk

Structured AI-Assisted

Detailed prompts + constraints · manual testing · features in real codebases

Agentic Engineering

Formal specs · automated tests + evals + CI gates · production scale · low risk

Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.

The idea worth building your strategy around

Agent = Model + Harness

~10%

HARNESS — prompts · tools · context · hooks · sandboxes · observability

MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S

Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.

“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.

The economics: it’s a token-cost problem (CapEx vs OpEx)

Vibe Coding

Low CapEx · High OpEx

Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.

Agentic Engineering

High CapEx · Low OpEx

Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.

85%

of devs use AI coding agents (51% daily)

41%

of all new code is AI-generated

~90%

of agent behavior is the harness, not the model

+19%

longer on some tasks (METR) — verification is the cost

The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.

thorstenmeyerai.com

Why System Design and Context Engineering Trump Model Size

This new perspective shifts the focus from chasing the latest AI models to optimizing the surrounding system — the harness and context. Organizations that understand this can better control costs, improve reliability, and develop more robust AI applications. It also suggests that competitive advantage lies not in acquiring the newest models but in mastering configuration, tooling, and system architecture.

Harness Engineering: Building Reliable AI Agent Systems (The Practical Tech Guide Series)

View Latest Price

As an affiliate, we earn on qualifying purchases.

The Evolution of AI Development Practices and the Rise of Agentic Engineering

Since early 2026, AI development has seen rapid adoption, with 85% of professional developers using AI coding agents regularly, and over 41% generating most code via AI. Historically, the focus was on model improvements, but recent research indicates that system configuration and context management have become more critical. The whitepaper builds on this shift, emphasizing that the ‘model’ is only a small part of the overall system, and that effective engineering now depends on how well the harness and context are engineered.

“The model accounts for only about 10% of the behavior; the harness and context engineering determine the rest.”
— Addy Osmani

Unresolved Questions About Implementation and Industry Adoption

While the whitepaper presents compelling evidence and a clear conceptual framework, it remains to be seen how quickly and broadly organizations will adopt this paradigm shift. Specific best practices for harness design, context management, and verification at scale are still evolving, and industry-wide standards have yet to solidify. Additionally, the long-term impact on AI model development and the role of model innovation in this new framework are still uncertain.

Next Steps for Organizations and AI Developers

Organizations should evaluate their current AI development practices, focusing on system architecture, harness design, and context engineering. Developing best practices, tools, and standards for configuration and verification will be crucial. Industry leaders are likely to invest more in system-level engineering and less in chasing ever-larger models, emphasizing cost-effective, reliable AI deployment. Further research and case studies are expected to refine these strategies over the coming months.

Key Questions

Why is the model only 10% of the system’s behavior?

The whitepaper shows that most of an AI system’s performance depends on how it is configured, the prompts, tools, and verification processes surrounding the model, which together account for about 90% of behavior.

What is ‘agentic engineering’?

Agentic engineering involves integrating AI into formal systems with structured specifications, automated tests, and verification, emphasizing configuration and context over raw model size.

How does this shift affect AI development costs?

Focusing on harness and context engineering can reduce long-term costs by improving reliability and reducing inefficiencies associated with vibe coding, despite higher upfront investment.

Will this change how AI models are developed?

While the emphasis shifts to system design, model development will still be important, but it will be complemented by a stronger focus on harness and context engineering for better outcomes.

What should organizations do now?

They should start assessing and improving their system architecture, develop expertise in context engineering, and prioritize verification and tooling to optimize AI performance.

Source: ThorstenMeyerAI.com

This content is for general information only and is not financial, tax or legal advice. Consult a qualified professional for decisions about your money.

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Cutrova: Edit the Words, Not the Timeline

Author

Similar Lists Team

Share article

The model is only 10%

Why System Design and Context Engineering Trump Model Size

Harness Engineering: Building Reliable AI Agent Systems (The Practical Tech Guide Series)

The Evolution of AI Development Practices and the Rise of Agentic Engineering

Unresolved Questions About Implementation and Industry Adoption

Next Steps for Organizations and AI Developers

Key Questions

Why is the model only 10% of the system’s behavior?

What is ‘agentic engineering’?

How does this shift affect AI development costs?

Will this change how AI models are developed?

What should organizations do now?

One Model, a Whole Portfolio: What Ten Days on Fable Mean for a Business Building on Frontier AI

Hobbies Like Birdwatching

Avengers Labs: How Ukraine Turned Its Front Line Into the World’s Scarcest AI Dataset

Acoustic Dampening, Placement, and the “Rig in the Closet” Setup

14 Best Laptop Backpacks for Students in 2026

Results Of The June 2026 Survey On Credit Terms And Conditions In Euro-denominated Securities Financing And OTC Derivatives Markets (SESFOD)

12 Best Action Figures for Collectors: Must-Have Additions for Your Display Shelf

14 Best Graphic Memoirs That Will Inspire and Move You

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Author

Similar Lists Team

Share article

The model is only 10%

Why System Design and Context Engineering Trump Model Size

Harness Engineering: Building Reliable AI Agent Systems (The Practical Tech Guide Series)

The Evolution of AI Development Practices and the Rise of Agentic Engineering

Unresolved Questions About Implementation and Industry Adoption

Next Steps for Organizations and AI Developers

Key Questions

Why is the model only 10% of the system’s behavior?

What is ‘agentic engineering’?

How does this shift affect AI development costs?

Will this change how AI models are developed?

What should organizations do now?

You May Also Like