Engineering performance — DORA, SPACE, METR, rework, and verification cost. These essays cover how AI-assisted engineering is measured, why old metrics mislead, and what to track once agents do the work.

Insights / Topics

Engineering performance

DORA, SPACE, METR, rework, and verification cost. These essays cover how AI-assisted engineering is measured, why old metrics mislead, and what to track once agents do the work. Browse all insights.

Cornerstone

Engineering12 min read

DORA Metrics Are Necessary But Insufficient For Agentic Development

DORA measures delivery-system behavior and still matters, but it cannot see whether an autonomous engineering system stays architecturally coherent as autonomy rises. Governance metrics are the missing third layer.

Read insight

You Can’t Govern What You Can’t See: The Visibility Gap in Agentic Engineering

Claude Code pricing and Copilot billing reveal a deeper problem: leaders have little visibility into an emerging agentic workforce. Costs are a symptom; governance starts with visibility.

Read insight

Research 11 min read

The Strongest AI Engineering Teams Won’t Be Built From Bigger Agents

A DevOps study found collaborative team structures beat both silos and full integration. Applied to AI agents, the missing variable is a governance layer — with the metrics to prove it.

Read insight

Industry Analysis 9 min read

Stanford AI Index 2026: AI Coding Is Becoming Solved. Engineering Governance Has Not.

The Stanford AI Index 2026 shows coding benchmarks near saturation and adoption mainstream. If generation is solved, the unsolved layer is keeping agent output aligned with architectural decisions.

Read insight

Productivity 7 min read

The SPACE Framework: Measuring GitHub Copilot’s Real Productivity Impact

Most Copilot ROI reports stop at accepted suggestions. The SPACE Framework — from GitHub, Microsoft Research, and University of Victoria — reveals the governance gap hiding beneath the activity gains.

Read insight

Thought Leadership 7 min read

METR's AI Productivity Studies: Why AI Coding Feels Fast but Measures Slow

Two METR studies say almost the opposite thing about AI's impact on developer productivity. Reconciling them shows what data teams should actually measure — and where governance pays back.

Read insight

Market Context9 min read

Why AI Coding Productivity Gains Often Lead to More Rework

Faros AI's 2026 telemetry shows throughput up 33.7% — and bugs per developer up 54%, code churn up 861%. Much of that churn is rework correcting AI changes that violated system-level decisions.

Read insight

Analysis10 min read

The Verification Tax of AI Coding Agents: Why Faster Code Creates More Review Work

Glean finds workers reclaim 11 hours a week from AI and hand 6.4 back in botsitting. For engineering teams the bill lands as a verification tax — and the refund is executable architectural constraints, not more review.

Read insight

Analysis8 min read

GitHub Just Turned AI Code Review Into a Budget Line Item

GitHub Copilot moved to usage-based AI Credits on June 1, 2026, and code review now burns Actions minutes per PR. When review is metered, ungoverned output becomes a finance problem — governance economics.

Read insight

Economics 14 min read

The AI ROI Problem Is Not About Models. It Is About Systems.

Weak enterprise AI ROI is not evidence that AI fails to create value. It is evidence that organizations matured generation capability faster than the governance and verification infrastructure needed to operationalize it. Generation is commoditizing. Verification is not.

Read insight

Thought Leadership 7 min read

Deployment Quality Will Define the AI Era

The first AI era rewarded early adoption. The next rewards operational quality. KPMG research points to deployment quality as the new differentiator — and for engineering teams, that starts with governance.

Read insight

Market Context11 min read

AI Adoption Maturity Model: A Technical Analysis for Engineering Leaders

CMU SEI and Accenture's new maturity model defines five levels and eight dimensions, with Risk and Governance among them. The question the model leaves open is how governance decisions actually reach coding agents.

Read insight

Industry Analysis 8 min read

Datadog’s State of AI Engineering Report Quietly Confirms the Governance Crisis

1,000+ production orgs. 70% running 3+ models. One sentence buried in the data: “In practice, model churn becomes a governance problem.” Here is what the report actually says about where the industry is headed.

Read insight