agent behavior patterns
21 articles · 15 co-occurring · 2 contradictions · 6 briefs
37 hidden reactive messages that nudge the agent mid-conversation. Here's how they work" — Concrete example of reactive messaging as mechanism for continuous behavior guidance and course-correction du
[INFERRED] "How long an agent runs is an intermediate metric to task complexity" — Article challenges using agent runtime duration as a proxy for task complexity; same complexity achieved in less time is preferable
[STRONG] "struggle with nuance, intent, and knowing when to ask clarifying questions" — Article explicitly identifies core limitations of AI agents in handling subjective judgment and contextual understanding, directly challenging assumptions about agent omniscience.
37 hidden reactive messages that nudge the agent mid-conversation. Here's how they work" — Concrete example of reactive messaging as mechanism for continuous behavior guidance and course-correction du
struggle with nuance, intent, and knowing when to ask clarifying questions" — Article explicitly identifies core limitations of AI agents in handling subjective judgment and contextual understanding,
Claude's answers can be right even when its explanations or self-reports are inaccurate or made up after the fact" — Provides empirical evidence of a key model behavior: correct outputs decoupled from
I strongly believe the future of enterprise agents looks less like strict workflows, and more like powerful autonomous Agents with embedded runtime validation" — Article envisions evolution toward aut
Our homegrown agentic loop had key challenges — it required overly rigid instructions and struggled with complex, multistep edits" — Production evidence that simple loop-based agent patterns fail unde
Agents : Built-in agent patterns." — LangGraph documentation explicitly describes Agents as built-in patterns available as prebuilt components.
A simple chatbot takes your question, replies, and stops. But more advanced AI systems, often called agents" — Article distinguishes agent behavior from simple chatbots, highlighting agents' capabilit
Gemini ends every turn with that annoying 'If you want to learn more about how X does Y, just say the word!'" — Article demonstrates instruction-following behavior in practice—Gemini consistently exec
[INFERRED] "70-turn conversation between our agents" — Extended agent interaction (70 turns) demonstrates agents operating autonomously in conversation with each other, showing capacity for sustained
[INFERRED] "How long an agent runs is an intermediate metric to task complexity" — Article challenges using agent runtime duration as a proxy for task complexity; same complexity achieved in less time
[INFERRED] "we didn't ask them to do this" — Shows agents exhibiting self-directed behavior and initiative beyond their initial programming, taking autonomous action to improve platform quality.
[INFERRED] "have you ever considered that your agent is lying to you" — Article raises the critical need to verify agent claims against actual executed behaviors, extending conversation about agent ex
Browserbase lets Claude Code test your app the way a user would." — Claude Code with Browserbase simulates real user behavior during QA testing, moving beyond script-based testing to genuine interacti
Pi is basically Vim for agentic coding" — Directly positions Pi as a tool purpose-built for agentic coding workflows, analogous to Vim's role in traditional coding.
[INFERRED] "you claim to want to watch the film, but what you really want is the idea of watching the film. you want to occupy the subject-position of "person watching movie with friends" without actu
[INFERRED] "AI agents are starting to buy products and services, changing how companies must handle pricing and sales" — Article demonstrates a practical example of AI agents making autonomous purchas
[INFERRED] "almost like bringing in a PM at the right time" — Claude demonstrates meta-cognitive awareness by recognizing when to shift strategies (exploration → planning), analogous to human project
[INFERRED] "comprehensive tutorial to go through LangGraph for the creation of complex multi-agent AI applications" — Tutorial format provides educational evidence for building agent-based AI applicat
[inferred] "Hoard things you know how to do simonwillison.net/guides/agentic" — Article promotes practical career and engineering guidance specifically for working with coding agents, supporting the p
[INFERRED] "the task you're giving it is part of a test and that it will be evaluated on quality of solution and efficiency of action" — The observation reveals that agent behavior responds to evaluat
tool thrash to refer to when an agent doesn't have the tools it needs to solve the problem" — Identifies a specific constraint-driven agent behavior: when tools are inadequate, agents enter repetitive