Brief #145
Multi-agent systems are failing in production not from model capability gaps, but from context architecture choices. Practitioners are discovering that intelligence compounds only when context persists across agent boundaries—and current frameworks reset rather than preserve.
Small Models Match Frontier via Context Architecture
EXTENDS model-selection-strategy4B parameter models achieve Sonnet 4.6 performance when fine-tuned for recursive self-delegation. Context utilization patterns matter more than parameter count.
4B model matching Sonnet 4.6 through recursive architecture—proves context flow architecture beats scale
Smaller model outperformed expectations because Letta Code's context management preserved state across turns
Multi-tier context architecture (primacy/recency slots, compression loops) enables intelligence compounding independent of model size
Distributed State Management Is Agent Deployment Blocker
95% of agent projects fail to reach production because context breaks during rebuilds. Teams building complex systems report AI struggles with state consistency across multiple components.
Practitioner reports AI fails at distributed state management—creates manual cleanup work in production systems
MCP Servers Enable Context Federation at Scale
Model Context Protocol creates persistent context connectors that compound intelligence across sessions rather than rebuilding access each time. Standardized interfaces unlock context portability.
MCP standardizes context access across diverse tools—thin wrapper servers compose in context window for federation
Agent Task Batching Reduces Context Fragmentation
Grouping related work items before agent processing preserves inter-item context and reduces decision overhead. Sequential isolation causes intelligence loss.
Batching 20 PRs into staging branch before agent testing—reduces from 20 isolated reviews to 1 holistic integration test
SDK Control Beats CLI for Production Agents
Sophisticated practitioners building production agents prefer programmatic SDK control over opinionated CLI tools. Long-running agent loops require custom context management layers.
Developer abandoned Claude Code CLI for SDK-based Codex to build long-running automations with full execution control
Cargo-Cult Prompting Obscures Context Specification
Teams treat prompting as mysterious incantation rather than structured information specification. Vague requests produce brittle, non-reproducible results.
Practitioner pushes back on slash-command worship—structured requests outperform magical keywords
Visual Context Bridges Enable Computer Use
Agents that perceive and interact with UI states maintain continuous problem understanding across tool transitions. Screen perception unlocks automation domains without API integration.
Claude maintained task context across opening Mac apps and retrieving token—computer use preserved problem statement across boundaries
Daily intelligence brief
Get these patterns in your inbox every morning — plus MCP access to query the concept graph directly.
Subscribe free →