▸ NOW LOADING WORKLOFT.LABS ◂

WORKLOFT LABS

Substrate before spectacle.

The research arm of Workloft. We track the AI frontier daily — 70+ papers a week, scored by Walt against nine substrate axes, citation-graphed via Semantic Scholar — and publish what to actually build for governed agent infrastructure.

▸ LABS API + MCP — get a free key All sections →

1623

Papers screened

Above threshold

Research axes

Notes published

        
        walt@workloft:~/arxiv-watch ▸ daily run · 2026-05-27
      
$ ./run.sh --since 30h --score
  ▸ pulled 71 unique papers across cs.AI + cs.LG + stat.ML
  ▸ HuggingFace Daily Papers index: 24 community-curated entries
  ▸ scored against 9 Workloft research axes (Gemini 2.5 Flash)
  ▸ enriching top 5 with Semantic Scholar citation graph...
  ✓ 5 papers cleared the 7/10 threshold
  ✓ digest dispatched · email + telegram + labs.html
$ _

▸ TODAY'S LAB BENCH

Five test tubes. Five papers. Top arXiv picks for today, scored against the Workloft research axes. Live from Walt's pipeline. Hover a tube. Click for the abstract.

▸ pulling today's picks from labs-api…

REG FIT = "would this clear an FCA Risk review, a UK GDPR DPIA, or a Local Authority procurement audit?"
●●● strong fit · ●●○ moderate · ●○○ low / academic-only.

▸ WORKLOFT LABS · OUTPUT · 5 PUBLISHED · NEW EVERY ~3 DAYS

WORKLOFT PAPERS

Substrate before spectacle.

Long-form research essays from the lab. One paper, one regulated lens, ~1,000 words each. Strong opinions, weakly held. Framed for FCA-regulated firms, UK Local Authorities and NHS Trusts — the buyers who have to defend the architecture, not the demo.

▸ THIS WEEK'S MUST-READ · PAPER №14

Trajectories Write Tests

PhoneWorld's architectural point: real usage yields both controllable environments and auto-generated verifiers. Let production usage write the test suite as a side effect.

~1,650 WORDS 8 MIN READ AGENTINFRASTRUCTURE · LLMEVALUATION · REGULATEDAI READ PAPER №14 →

All 15 papers → Read via API → Submit a runtime →

▸ THE LAB

Pick a wing. Each one opens to its own page — methodology, ledger, problems, the pipe, the watch-list.

9×

The 9 Axes

The rubric Walt scores every paper on. Published openly because rubrics that don't survive sunlight aren't rubrics.

Methodology→

✓

Replication Ledger

Papers we've cited and actually rebuilt. What worked, what didn't, what the authors didn't write down.

Proof of work→