The Harness Is the Control Surface Nobody Audits

§1The angle

Everyone audits the model. The model gets the red-teaming, the eval suite, the model card, the line in the procurement questionnaire. Almost nobody audits the harness: the runtime interface that decides what tools the agent can reach, what the agent sees of the world, how its outputs get parsed, and what gets fed back on the next turn. HarnessX (arXiv:2606.14249) is interesting precisely because it takes that harness seriously as an engineering object, treats it as compositional, and then makes it evolve from execution traces. That is the right instinct technically and a governance problem dressed up as a feature.

The substrate take: the harness is the actual control surface for an agent in production. It shapes behaviour more reliably than the weights do, because it sits between the model and every consequential action. HarnessX is the first paper we have seen this cycle that names the harness as a first-class artefact you can compose and improve. The moment you make it evolvable from traces, you have built a system that changes its own control surface in response to live data. For an FCA-regulated firm or a Local Authority, that is the bit the audit was supposed to pin down, now deliberately set in motion.

§2What HarnessX actually proposes

Three moving parts. First, compositional primitives: the harness is not a monolith but a set of building blocks (tool adapters, observation formatters, feedback channels) you assemble. Second, trace-driven evolution: the system reads execution traces, what the agent did, what worked, what failed, and proposes changes to the harness design itself. Third, a feedback loop that runs in both directions, improving the harness and informing model training off the same trace data.

The compositional bit is genuinely good. If your harness is a pile of building blocks rather than one undifferentiated lump of glue code, you can reason about each block, version each block, and swap one without rebuilding the whole interface. That is exactly the property you want for change control. Workloft has argued before that the runtime interface deserves the same versioning discipline as the model, and HarnessX gives that argument a concrete shape.

The evolution bit is where the trouble starts. Trace-driven evolution means the harness in production at 09:00 on Monday is not necessarily the harness in production at 17:00 on Friday. It learned something from the week's traces and changed. If you cannot say, for a given agent decision, which version of the harness was active and what it would have done differently, you have lost the chain that every audit framework assumes exists.

§3Why this lands hard on regulated buyers

Take the FCA. SS1/23 and the broader model risk expectations assume you can describe the system that produced a decision, hold it stable enough to test, and reproduce its behaviour for review. A harness that rewrites itself from last week's traces breaks the second and third of those by design. The model weights might be frozen and signed off; the behaviour still drifts, because the surface in front of the model moved. Your model risk sign-off now covers the wrong layer.

The ICO problem is sharper. UK GDPR Article 22 and the ICO's AI guidance (§11 on explainability and the producer-versus-reviewer separation) require that you can explain a decision affecting a data subject and that the logic is documented. "The harness evolved to format the observation differently, which changed what the model attended to" is not an explanation a data subject or a regulator will accept, and it is not one you can give if the evolution is not logged at the same fidelity as the decision itself.

For a Local Authority running an agent on housing allocation or benefits triage, FOIA and the public-law duty to give reasons apply to the decision the system reached. If the harness that shaped that decision is a self-modifying artefact with no version pinned to the case file, the authority cannot meet a reasons request and cannot defend the decision on judicial review. The evolvability that makes HarnessX attractive to a research lab is the exact property that makes it indefensible to a public body without heavy instrumentation around it.

§4The substrate fix HarnessX gestures at but does not build

The paper's own mechanism contains the answer, if you read it as an infrastructure builder rather than a researcher chasing capability. Trace-driven evolution requires traces. If you already have traces rich enough to evolve the harness, you have traces rich enough to audit the harness, provided you treat them as a permanent, tamper-evident record rather than a transient training input.

The discipline regulated buyers need on top of HarnessX is three things the paper does not provide. One: every harness version gets a content hash, and every agent decision logs the hash of the harness that produced it. That makes "which harness decided this" a one-line lookup instead of an archaeology project. Two: evolution is proposal, not deployment. The trace-driven loop suggests a new harness; a human or a separate policy gate approves it before it touches production, and the approval is logged. That preserves the change-control trail SS1/23 expects. Three: the producer-evolver and the auditor are different code paths owned by different people. The system that proposes harness changes must not be the system that signs them off, the same separation ICO §11 wants between the thing that makes decisions and the thing that reviews them.

None of this kills the capability. You still get a harness that improves from real traces. You just gate the deployment and pin the version, which costs you a few hours of latency on harness changes and buys you a defensible audit chain. For a regulated buyer that trade is not close.

§5What the paper does not solve

HarnessX is a capability paper, not a governance one, and it does not pretend otherwise. It gives no version-pinning scheme, no tamper-evidence on the traces it learns from, and no separation between the evolver and any reviewer. The traces are treated as a training resource, which is the opposite of treating them as evidence: training data gets consumed and discarded; evidence gets retained and protected. The two needs pull in opposite directions and the paper only serves one.

It also says nothing about adversarial trace poisoning. If the harness evolves from execution traces, an attacker who can shape those traces can steer the harness, and the harness is the control surface for every downstream action. That is a real attack on the substrate, in NCSC terms a supply-chain and integrity problem, and the paper offers no defence because defence was not its question. We do not fault the authors for that; we fault anyone who deploys this in a regulated setting without building the version pinning, the deployment gate, and the trace integrity that the paper leaves as an exercise for the reader. The harness is the control surface. Treat it like one.

Methodology note. This Note takes HarnessX (arXiv:2606.14249) as a capability paper with a governance shadow. Triggers: substrate-relevant (the harness is the runtime control surface, not the model, and this paper makes it evolvable); non-duplicative (we have covered runtime interfaces but not self-modifying harnesses or trace-as-evidence tension); regulated-buyer link (FCA model risk under SS1/23, ICO explainability §11, Local Authority duty to give reasons under FOIA and public law). The Workloft angle: traces rich enough to evolve a harness are rich enough to audit it, if retained as evidence not training data. Forthcoming: a reference pattern for harness version-pinning and deployment gates for regulated agent runtimes.