The Spec Growth Engine: Spec-Anchored, Code-Coupled, Drift-Enforced Architecture for AI-Assisted Software Development

Published 25 Jun 2026 in cs.SE and cs.AI | (2606.27045v1)

Abstract: AI coding agents dramatically accelerate implementation speed but introduce two structural failure modes that existing spec-driven approaches do not fully solve: (1) context explosion -- the agent must reason over an entire repository at once, degrading output quality as the context window fills; and (2) silent spec-code drift -- code evolves, the specification does not, and the divergence becomes invisible until it is costly to repair. We present the Spec Growth Engine, a lightweight framework that addresses both failure modes through a machine-readable spec graph whose nodes carry explicit contract/design separation, a Spine context assembler that scopes agent context to an ownership path, a vertical-slice growth protocol that enforces hardest-first ordering, and a drift gate that makes spec-code divergence a blocking merge condition. The design synthesises well-established software engineering principles (Parnas information hiding, C4, ADRs, Walking Skeleton, Reflexion Models, Fitness Functions) into a lean, code-coupled, machine-enforced whole -- without the overhead of heavy-weight frameworks such as RUP or MDA.

Abstract PDF Upgrade to Chat

Authors (1)

Hartwig Grabowski

Summary

The paper presents a framework that couples code and specs to prevent context explosion and silent spec-code drift through strict architectural contracts.
It introduces a Spec Graph and Context Assembler to deterministically scope agent context and reduce input size, improving metrics from 29% to 3%.
The work integrates classical architectural principles with AI coding practices to enforce machine-validated specifications and scalable system evolution.

Authoritative Summary of "The Spec Growth Engine: Spec-Anchored, Code-Coupled, Drift-Enforced Architecture for AI-Assisted Software Development" (2606.27045)

Problem Characterization

The proliferation of AI coding agents has revealed structural deficiencies in contemporary AI-assisted software development. Two principal failure modes dominate: context explosion and silent spec-code drift. Context explosion arises from indiscriminate context expansion—agents are forced to reason over entire repositories, degrading solution quality due to context window saturation, as empirically demonstrated by precipitous performance declines on long-context coding benchmarks. Silent spec-code drift manifests when code and specification diverge without detection; the spec ceases to reflect current system behavior, leading subsequent agent runs astray and compounding architectural debt.

Existing frameworks at the spectrum’s extremes—spec-first and spec-as-source—are inadequate. Spec-first approaches induce upfront overhead and risk mis-specified architectures, while spec-as-source systems introduce nondeterminism and an unaccepted single point of truth, often rejected by practitioners.

Spec Growth Engine Architecture

The Spec Growth Engine is positioned as a minimal, enforceable, spec-anchored yet code-coupled framework. Its architecture integrates classical software engineering principles (Parnas information hiding, ADRs, C4 model, Fitness Functions) through five interlocked components: the Spec Graph, Context Assembler, Drift Validator, Governance Gates, and Capability Registry.

Spec Graph

Each architectural entity is a node materialized as a SPEC.md file, stratified at C4 hierarchy levels: system → container → component → code. Every node distinguishes between contract (outward) and design (inward), implementing information hiding; contracts are accessible to neighboring nodes, while designs are restricted to owners and parents. Ownership edges define a tree structure (Spine), and dependency edges form an acyclic DAG, maintaining strict contract-only dependency visibility.

Context Assembler and Spine

Agent context is deterministically scoped via the Spine, comprising root invariants (from ARCHITECTURE.md), ownership path contracts, the node’s full spec, one-hop dependency contracts, and owned code. Notably, context excludes sibling details, dependency designs, code, transitive dependencies, and arbitrary search, guarding against context explosion. Cross-cutting invariants are maintained in ARCHITECTURE.md, prepended to every agent context bundle.

Vertical Slices and Growth Protocol

System evolution proceeds by vertical slices—end-to-end increments traversing architectural layers—ordered hardest-first. Layer-1 invariants are specified upfront (not just-in-time), encompassing architectural boundaries (persistence, security, external integrations). Features grow as vertical slices, with real implementations and restricted use of frontier stubs exclusively at future slice boundaries.

Drift Validator

Drift validation is central; it compares intent (SPEC.md) and evidence (code-derived) graphs. Alignment is machine-enforced: divergences (orphan code, undeclared dependencies, contract bypasses) are hard errors that block merges. The AI agent is mandated to update the spec in the same commit as the code, and humans review only contract-level changes. This approach negates hidden drift by structural enforcement, not process discipline.

Governance Gates and Capability Registry

Governance gates calibrate review authority to blast radius and reversibility. Changes affecting root invariants or public contracts require human approval (HARD); additive component boundaries are SOFT (asynchronous review); internal design and refactoring are AUTO (agent autonomy). The Capability Registry is a repository-wide reuse axis: capabilities are declared and shared, preventing duplication and enforcing architectural DRY principles.

Incremental, Minimal Growth Rules

Growth is rule-based and incremental: changes trigger specific rules, each prescribing the minimal coupling of artifacts (contract, acceptance criteria, code, tests, spec graph structure). This ensures contract preservation and restricts context width, preventing “greedy” context expansion.

Numerical Results and Claims

The paper provides compelling empirical support for context explosion, citing a sharp performance drop from 29% to 3% as the context window grows from 32K to 256K tokens in long-context coding benchmarks. The framework’s strict enforcement transforms drift from a process hazard to a structural guarantee, and context-scoping via the Spine yields significant reductions in agent input size, with practical implications for output quality. These claims are substantiated by references to robust research on context rot and LLM behavior on long inputs.

Practical and Theoretical Implications

Practically, the Spec Growth Engine offers a scalable, low-overhead solution for AI-driven codebases, mitigating brittleness by anchoring specifications and enforcing their evolution in lockstep with code. Its deterministic, derived index eliminates hand-maintained architectural artifacts, positioning the framework for rapid iteration without architectural erosion. Theoretically, the integration of established principles into an enforceable machine-readable protocol extends classical architecture documentation practices into the era of AI code generation and agent-based software development.

By preventing silent drift and bounding context, the engine circumvents the major failure modes of both traditional and AI-assisted development. The structural guarantees provided by blocking gates and deterministic derivations may catalyze broader adoption of spec-anchored, contract-first protocols in future AI engineering platforms.

Comparison with Contemporary Tools

In contrast to Kiro (spec-first, non-persistent specs), Spec Kit (attempted spec-anchored, excessive verbosity, relies on agent discipline), and Tessl (bidirectional sync, spec-as-source, inherits MDA nondeterminism), the Spec Growth Engine advances the spec-anchored, code-coupled middle ground. It maintains code as primary, with machine-verified alignment, and rejects both spec abandonment and generative spec-driven code, avoiding the fragilities documented in prior frameworks.

Discussion and Limitations

Overhead is confined to one SPEC.md per node and a single ARCHITECTURE.md per system—a marked reduction relative to traditional documentation. The framework relies on static dependency graph derivation; dynamic architectures with runtime binding or plugin systems require explicit annotations. Governance gates introduce operational overhead for fast-moving teams, partially mitigated by the SOFT gate. The engine’s approach aligns with core SWEBOK v4 Knowledge Areas (Architecture, Design, Testing, Quality, Management, Process).

Conclusion

The Spec Growth Engine constitutes a technically rigorous, enforceable framework for AI-assisted software engineering. It synthesizes classical architectural doctrines into a structure that precludes context explosion and spec-code drift, leveraging machine-readable artifacts, deterministic context scoping, contract-based dependency boundaries, and governance gates to ensure architectural integrity. This paradigm equips AI coding agents and human stakeholders with a scalable, contract-preserving methodology, laying a foundation for robust and evolvable AI-powered software systems (2606.27045).