📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent whitepaper emphasizes that in AI-assisted software development, the model itself is only 10% of the system. The focus should be on harness design and context engineering, which drive behavior and cost efficiency.

A new whitepaper from Google, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, states that the model accounts for only about 10% of an AI-driven system’s behavior. This challenges common assumptions and highlights that the harness and context engineering are the primary drivers of performance and cost management in AI coding workflows.

The paper emphasizes that the dominant factor in AI system behavior is not the underlying model but the harness: the prompts, tools, rules, and observability layers surrounding the model. Experiments cited show that changing only the harness can dramatically improve performance, even when using the same model. For example, moving an agent from outside the Top 30 to Top 5 on a benchmark was achieved solely through harness adjustments.

Furthermore, the paper introduces the concept of context engineering as a core skill. It involves curating instructions, knowledge, memory, examples, and guardrails to optimize the model’s output. The authors argue that effective context management reduces costs and improves reliability, especially when loading only necessary information dynamically rather than constantly streaming static context.

From an economic perspective, the whitepaper warns that vibe coding—prompt-based, minimal oversight work—may seem cheap initially but incurs higher long-term costs due to token inefficiency, maintenance, and security vulnerabilities. Disciplined, agentic engineering approaches, while requiring higher upfront investment, result in lower marginal costs and more predictable outcomes.

At a glance
reportWhen: published March 2026
The developmentThe Google whitepaper argues that the key to effective AI coding is not the model but the harness and context, shifting the focus in SDLC strategies.
The Model Is Only 10% — The New SDLC With Vibe Coding
AI Dispatch · Field Notes
Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified
Vibe Coding
Casual prompts · “does it seem to work?” · disposable code · high risk
Structured AI-Assisted
Detailed prompts + constraints · manual testing · features in real codebases
Agentic Engineering
Formal specs · automated tests + evals + CI gates · production scale · low risk
Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.
The idea worth building your strategy around
Agent = Model + Harness
~10%
HARNESS — prompts · tools · context · hooks · sandboxes · observability
MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S
Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.
“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.
The economics: it’s a token-cost problem (CapEx vs OpEx)
Vibe Coding
Low CapEx · High OpEx
Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.
Agentic Engineering
High CapEx · Low OpEx
Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.
85%
of devs use AI coding agents (51% daily)
41%
of all new code is AI-generated
~90%
of agent behavior is the harness, not the model
+19%
longer on some tasks (METR) — verification is the cost
The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.
thorstenmeyerai.com

Implications for AI Development and SDLC Strategies

This shift in understanding underscores that organizations should prioritize harness design and context engineering over chasing the latest models. The findings suggest that the competitive advantage lies in how AI systems are configured, maintained, and secured, not just in model selection. For developers and leaders, this means rethinking resource allocation, emphasizing configuration, tooling, and process discipline to maximize value and control costs in AI projects.

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Background on AI’s Evolving Role in Software Engineering

Since early 2026, AI coding agents have become widespread, with 85% of developers using them regularly. The industry has focused heavily on adopting new models, but this whitepaper challenges that focus, emphasizing that the real leverage comes from how these models are integrated and managed. Prior discussions have centered on model improvements, but recent experiments indicate that harness and context are more impactful in performance and cost-efficiency.

“The model is only 10% of the system; the rest is how you harness and guide it.”

— Addy Osmani

Observability in the AI-Native Era: Leveraging AIOps to build, observe, and operate resilient systems

Observability in the AI-Native Era: Leveraging AIOps to build, observe, and operate resilient systems

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Unresolved Questions About Implementation and Scope

While the whitepaper provides strong evidence that harness and context are critical, it does not specify exact best practices for different AI applications or how these principles scale across diverse industries. The precise methods for measuring and optimizing harness components remain under discussion, and the long-term impact on AI development workflows is still being observed.

AI Context Engineering: Architecting Intelligence Through Prompt Structures, Tools, and Memory

AI Context Engineering: Architecting Intelligence Through Prompt Structures, Tools, and Memory

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps for Developers and Organizations

Organizations should evaluate their current AI workflows, focusing on harness design and context management. Developing standardized tools, frameworks, and training around these areas will be essential. Further research and experimentation are expected to refine best practices, and industry leaders may begin to prioritize configuration over model upgrades in their AI strategies.

AI-Powered Web Design Mastery: Harness the Power of Framer AI to Build, Customize, and Launch Stunning Websites—A Step-by-Step Guide for Beginners and Professionals

AI-Powered Web Design Mastery: Harness the Power of Framer AI to Build, Customize, and Launch Stunning Websites—A Step-by-Step Guide for Beginners and Professionals

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of the system behavior?

The whitepaper shows that the behavior of AI systems is primarily determined by how the model is integrated, guided, and constrained through prompts, tools, and rules — collectively called the harness.

How does this change current AI development practices?

It shifts focus from constantly chasing new models to investing in harness design, context engineering, and configuration management to improve performance and reduce costs.

What are the economic implications of this insight?

While vibe coding appears cheap initially, disciplined harness and context engineering offer lower long-term costs, better security, and more predictable outcomes, making them a more cost-effective approach.

Will this approach work with all AI models?

The principles are broadly applicable, but specific strategies will vary depending on the model architecture and application domain. Ongoing research will clarify best practices.

What should organizations do now?

Focus on improving harness design, develop tools for context management, and train teams in disciplined AI configuration practices to maximize value and control costs.

Source: ThorstenMeyerAI.com

You May Also Like

Using AI to Update Old Content at Scale

With AI-driven updates at scale, you can efficiently refresh old content—discover how to unlock its full potential now.

The 90-Day Window Closed. Nobody Sent a Notice.

The 90-day window for responsible vulnerability disclosure has closed without any notices or patches from vendors, raising concerns about AI-driven exploits.

Singapore: Engineer the Transition

Singapore’s approach to economic and technological transition relies on calibrated, well-funded policies focusing on continuous reskilling and AI innovation.

Using Open-Source AI Writing Software Alternatives

Boost your writing flexibility with open-source AI alternatives—discover how customizing these tools can revolutionize your workflow and more.