The Fleet Reaches Home

§1The reach runs the other way

When people put a home machine and a cloud server on the same private network, they almost always picture one direction of travel: sit at the home machine, reach out to the powerful box in the cloud. That is the obvious win and it is real. It is also the smaller half of what just became possible for us.

The bigger half is the reverse. Our agents do not live on a laptop at home. They live on a server, and that server can now reach into hardware we own. A private mesh network does not just carry us out to the cloud. It lets the cloud fleet quietly use a machine sitting on our own desk as if it were one more node in the rack. Once you see it that way, an ordinary owned computer stops being something you log into and becomes something the fleet can put to work.

§2What we wired, plainly

We took a modest machine we own, one with a capable onboard GPU, and stood up a small open-weight model on it. We then slotted that machine into our model router as the first choice for one specific kind of work: anything tagged private. Our router already had a sovereign tier, a rule that says PII-sensitive jobs must run on a local model and never touch a third-party API. Until now that tier ran on the server's own processor, which is slow because the server has no GPU.

The owned machine changed the economics of that tier without changing the guarantee. The same private jobs now run on its GPU, several times faster, and the heavy lifting happens off the server entirely, so the server stays lean for the live agents. And we kept the old path as a safety net: if the owned machine is asleep or the mesh drops, the router notices within seconds and falls back to the server's slower local model on its own. No human in the loop, no failed job. Fast and owned when it can be, slow and owned when it must be, never reaching for a cloud API with private data.

§3The honest ceiling

Here is the part a louder version of this story would skip. The machine is small. It can comfortably run a seven-billion-parameter model and no more. That makes it a genuinely useful engine for the dull, high-volume work, classifying a record, pulling fields out of a document, triaging a queue, and a poor one for hard reasoning. We are not claiming we run a frontier model in a cupboard. We run a small, fast, private workhorse for the tasks that suit it, and we route the hard thinking to where the hard thinking belongs.

That distinction matters because sovereignty is the easiest thing in this field to oversell. "Runs locally" is doing a lot of work in most pitches. The honest claim is narrower and more defensible: the private, high-volume pre-processing happens on hardware we own and control, and only derived or non-sensitive work ever leaves it. Said that way, it is a promise we can actually keep.

§4Why owning the box is the point

The reason to care is not cost. Cheap cloud APIs are cheaper per token than almost anything you can run at home. The reason is control. Running a capable model on hardware you physically own is the one thing an API model structurally cannot offer, because the data never leaves your possession to begin with. For a regulated buyer, a council, a firm under a data-residency obligation, that is not a nice-to-have. It is the difference between a conversation that can proceed and one that cannot. An owned node turns "your data stays on hardware you control" from a line on a slide into something you can point at.

§5Where this goes

One owned machine is a proof, not a platform. The interesting future is the shape it implies. A fleet does not need its private tier to live in one place. It can spread across several owned edge nodes, a machine in an office, one at home, one on a client's own premises, each joined to the same private mesh, each holding the sensitive part of a workload close to where the data actually sits. The cloud orchestrates and does the heavy reasoning. The owned edge holds and pre-processes the private material and answers fast.

That is a credible answer to the question every serious buyer eventually asks, which is not "how clever is your model" but "where does our data go". Increasingly the strongest answer is: through hardware you or we own, on a network nobody else is on, and only the safe residue ever reaches a cloud. The work this week was small. The direction it points is not. The cheapest, most defensible node in your stack may turn out to be a machine you already own, sitting quietly on a desk, waiting for the fleet to ask it for something.

Methodology note. This Note is a first-person report on wiring an owned machine into our model router as the primary sovereign-inference tier, with automatic fallback to the server's local model when the owned node is unreachable. No client or personal data was involved in the work. Performance characterisations (faster than the server's CPU tier; a seven-billion-parameter ceiling) are our own measurements on our own hardware.

§1The reach runs the other way

§2What we wired, plainly

§3The honest ceiling

§4Why owning the box is the point

§5Where this goes

▸ Related