How spec driven development and agents work together

00:00 | 18:16

This edition traces a single idea: agents only work when the work is framed. We show how parallel flow turns intent into reviewable diffs, how ACE/FCA shapes context into precise, testable chunks, and where humans keep taste and final judgment. The narrative moves from framing to execution to oversight: define the unit of work, run many in clean environments, measure outcomes by diffs, and reserve judgment for humans. Precision and parallelism beat heroics, and teams that formalize both will out-ship those chasing demos.

TL;DR

•	Parallel flow, not single-thread loops: how Ona orchestrates agent threads and routes diffs for human review
•	Scale agents with ACE/FCA: frame large codebases into coherent context chunks to cut hallucinations and boost precision
•	Where agents help and where humans judge: lessons from OpenAI's internal Codex adoption
•	Kingland × Ona on-demand: Real agent usage adoption patterns for regulated teams
•	Don't miss out: Upgrade to Core in September and get $100 credits
•	ICYMI: Gitpod Classic PAYG sunsets Oct 15, 2025

Ecosystem watch

Advanced Context Engineering for Coding Agents

A practical framework for scaling agent performance by structuring context windows. Covers the ACE/FCA approach: break down large codebases into framed context chunks that agents can consume without hallucinations, while maintaining semantic continuity. Useful if you're pushing agents past simple single-file edits.

Coding as the Epicenter of AI Progress

Argues that all major AI breakthroughs ultimately bottleneck at coding: evaluation, alignment, interpretability, and application flow through code artifacts. Reframes programming not as a niche use case, but as the testing ground for the entire AI research stack.

Dev Containers for AI Agents

Santiago (@svpino) shows how containerized dev environments make agent work safe, reproducible, and fast. The demo runs agents in parallel across isolated containers, routes results for review, and reconnects from VS Code or mobile without drift.

How OpenAI Uses Codex

Pulls back the curtain on how OpenAI itself adopted Codex internally: from refactoring legacy repos to automating repetitive dev tasks. The paper highlights where agents reduce friction (tests, migrations) and where humans still need to validate intent and taste.

Tau² Benchmark

Benchmarking work that shows smaller models can outperform larger ones when task framing and evaluation are optimized. For teams weighing cost/performance trade-offs in production, Tau² suggests investing in better task design and validation loops rather than chasing the biggest model.

The Future of Agent Collaboration

A community draft spec on how multiple coding agents should coordinate: shared memory, task arbitration, escalation triggers, and conflict resolution. It sketches what a "protocol layer" for agent teamwork might look like, beyond today's isolated single-agent workflows.

Ona Updates

In Search for Parallel Flow

A concrete look at why single-threaded "edit → run → fix" loops cap output. The piece shows how Ona splits work into parallel agent threads with explicit intents, then routes results back as diffs for human review. You'll see how automations and standardized dev containers let teams queue jobs, enforce audit trails, and keep latency low while throughput climbs.

ICYMI: Take advantage of Ona's September offer

Upgrade to Core in September and get $100 in credits. Ideal if you want to trial parallel flows (migrations, doc gen, multi-repo updates) under enterprise guardrails.

Kent Beck: "Genie Fight" (Tidy First)

A pragmatic argument that real leverage isn't typing speed; it's shaping problems and validating outcomes. Beck frames agents as force multipliers when you define tight contracts (inputs, invariants, tests) and judge them by diffs that pass, not demos that impress. Useful lens for leaders deciding where agents should own steps vs. where humans retain taste and final approval.

Changelog: Slash commands

Turn proven workflows into one-line, repeatable actions. Parameterize tasks like `/migrate`, `/doc`, `/bump`, or `/triage`, attach guardrails (IAM/RBAC, audit logs), and trigger multi-agent runs that return diffs for review. Version commands as your practices evolve so new teammates ship the same quality on day one.

ICYMI: Gitpod Classic PAYG sunsets on October 15

Gitpod Classic pay-as-you-go users must migrate to Ona by October 15th, 2025. This change does not apply to Enterprise customers. See our guide for migration tips and details.

ICYMI: On-demand Fireside chat: Kingland × Ona

In this fireside chat we go deep with Patrick to hear how Kingland performs 10x faster migrations and how they automate documentation on 15-year old legacy code. All whilst maintaining compliance as a CMMI Level 5 org. Get hands-on tactical insights on how to successfully adopt coding agents at scale.

Upcoming events

Platform Engineering Day @ KubeCon + CloudNativeCon NA — Atlanta, Nov 10, 2025

AWS re:Invent — Las Vegas, Dec 1–5, 2025

May your diffs be short and your PRs uneventful,

Your friends at Ona

We see specs as the key to making agents work effectively and predictably.

How spec driven development and agents work together

Ecosystem watch

Ona Updates

Upcoming events

Join 440K engineers getting biweekly insights on building AI organizations and practices