Engineering performance
DORA, SPACE, METR, rework, and verification cost. These essays cover how AI-assisted engineering is measured, why old metrics mislead, and what to track once agents do the work. Browse all insights.
You Can’t Govern What You Can’t See: The Visibility Gap in Agentic Engineering
Claude Code pricing and Copilot billing reveal a deeper problem: leaders have little visibility into an emerging agentic workforce. Costs are a symptom; governance starts with visibility.
The Strongest AI Engineering Teams Won’t Be Built From Bigger Agents
A DevOps study found collaborative team structures beat both silos and full integration. Applied to AI agents, the missing variable is a governance layer — with the metrics to prove it.
Stanford AI Index 2026: AI Coding Is Becoming Solved. Engineering Governance Has Not.
The Stanford AI Index 2026 shows coding benchmarks near saturation and adoption mainstream. If generation is solved, the unsolved layer is keeping agent output aligned with architectural decisions.
The SPACE Framework: Measuring GitHub Copilot’s Real Productivity Impact
Most Copilot ROI reports stop at accepted suggestions. The SPACE Framework — from GitHub, Microsoft Research, and University of Victoria — reveals the governance gap hiding beneath the activity gains.
METR's AI Productivity Studies: Why AI Coding Feels Fast but Measures Slow
Two METR studies say almost the opposite thing about AI's impact on developer productivity. Reconciling them shows what data teams should actually measure — and where governance pays back.
Why AI Coding Productivity Gains Often Lead to More Rework
Faros AI's 2026 telemetry shows throughput up 33.7% — and bugs per developer up 54%, code churn up 861%. Much of that churn is rework correcting AI changes that violated system-level decisions.
The Verification Tax of AI Coding Agents: Why Faster Code Creates More Review Work
Glean finds workers reclaim 11 hours a week from AI and hand 6.4 back in botsitting. For engineering teams the bill lands as a verification tax — and the refund is executable architectural constraints, not more review.
GitHub Just Turned AI Code Review Into a Budget Line Item
GitHub Copilot moved to usage-based AI Credits on June 1, 2026, and code review now burns Actions minutes per PR. When review is metered, ungoverned output becomes a finance problem — governance economics.
The AI ROI Problem Is Not About Models. It Is About Systems.
Weak enterprise AI ROI is not evidence that AI fails to create value. It is evidence that organizations matured generation capability faster than the governance and verification infrastructure needed to operationalize it. Generation is commoditizing. Verification is not.
Deployment Quality Will Define the AI Era
The first AI era rewarded early adoption. The next rewards operational quality. KPMG research points to deployment quality as the new differentiator — and for engineering teams, that starts with governance.
AI Adoption Maturity Model: A Technical Analysis for Engineering Leaders
CMU SEI and Accenture's new maturity model defines five levels and eight dimensions, with Risk and Governance among them. The question the model leaves open is how governance decisions actually reach coding agents.
Datadog’s State of AI Engineering Report Quietly Confirms the Governance Crisis
1,000+ production orgs. 70% running 3+ models. One sentence buried in the data: “In practice, model churn becomes a governance problem.” Here is what the report actually says about where the industry is headed.