← All concepts

safety constraints

20 articles · 15 co-occurring · 0 contradictions · 48 briefs

making sure the model can't even express an action it's not allowed to take" — Article directly advocates for architectural constraints that make illegal actions impossible at the model output level,

2026-W22
19
2026-W21
127
2026-W20
121
2026-W19
82
2026-W18
102
2026-W17
92
2026-W16
77
2026-W15
73

We got tired of browser frameworks restricting the LLM. So we removed the framework." — Directly addresses framework constraints as the core problem solved

making sure the model can't even express an action it's not allowed to take" — Article directly advocates for architectural constraints that make illegal actions impossible at the model output level,

Transactional No-Regression (TNR), which enables safe exploration and iteration" — TNR introduces a novel safety specification formalism for agentic systems, extending safety constraint frameworks wit

after converting a large portion of the codebase to strict types and fail fast, codex actually starts to pick up what we are doing here" — Demonstrates that strict type systems improve AI agent compre

engineering to "Harness Engineering"—the practice of building deterministic safety nets around non-deterministic models" — Article explicitly introduces 'Harness Engineering' as a practice of wrapping

High time people read "The Goal", it's not a fun read, but it's an important read specifically on bottlenecks" — Direct reference to Goldratt's Theory of Constraints classic text as foundational to un

committed to leveraging AI in a responsible, effective, ethical, and safe manner" — Windreich Department explicitly prioritizes safe and ethical AI deployment in clinical settings, directly supporting

If enough builders share even a slice of their traces publicly, we can create the largest crowdsourced open dataset for agents." — The article advocates for and demonstrates a crowdsourcing strategy t

An ESM defines organization's data, processes, and policies" — Article extends policy concepts by showing how semantic models encode organizational policies to guide AI decision-making

how to write Goals that give Codex a clear outcome, constraints and verification criteria" — Article demonstrates how Goals in Codex specify outcome, constraints and verification—a concrete implementa

The findings yield several unexpected insights, which are discussed in detail in the paper, offering a deeper understanding of how context affects the reliability and effectiveness of LLM-assisted alg

we only found two really clear ones -- a "current thing" slot and a "previous thing" slot" — Empirical evidence that entity representation capacity is severely limited (binary slot count), suggesting

i'd hit them once a week if i was lucky" — Quantified user observation of constraint frequency differences between models in active use

[INFERRED] "claude code subs only work if you use our harness that honors feature flags and honors caching properly" — Illustrates a novel constraint pattern in headless SaaS: platform harnesses must

[EXPLICIT] "What do they need to know about coding practices in order to be more effective" — Article directly asks what coding practices non-programmers must understand to be effective with AI assist

[DIRECT] "So far I've received: 3x Tea bags, 2x Japanese KitKats, 2x Hot sauces, 2x Glass ducks, 3x Random Candies" — Concrete evidence of repetitive selection pattern showing Claude's constraint in g

[INFERRED] "those bottlenecks focus the efforts of AI labs leading to breakthroughs that unlock new areas of work" — Article demonstrates how capability bottlenecks paradoxically drive focused researc

[inferred] "Apple Container is an interesting initiative... provide constrained environments for agents to run in" — Apple Container represents a practical implementation of constrained execution envi

[INFERRED] "people believing AI agents are capable of arbitrarily solving the problems they know how to prompt and verify" — Article identifies a specific cognitive bias in how developers assess agent

[inferred] "the latter will be more powerful" — Author asserts that reducing implicit knowledge is more powerful than attempting to capture it, prioritizing simplification

query this concept
$ db.articles("safety-constraints")
$ db.cooccurrence("safety-constraints")
$ db.contradictions("safety-constraints")