rag retrieval strategies

38 articles · 15 co-occurring · 3 contradictions · 0 briefs

Retrieved documents (Layer 3) are positioned as a distinct context layer; this implies RAG as a context engineering problem, not just a search problem.

Related concepts

multi agent orchestration 27 context window management 25 tool integration patterns 17 multi turn conversation management 11 context window optimization 10 state management 9 prompt architecture 9 prompt engineering 8 model selection strategy 6 retrieval augmented generation 5 agent orchestration 5 memory persistence 4 tool use in agents 2 performance optimization 2 context preservation across sessions 2

Contradictions

Multi-Agent AI Systems: Architecture & Failure Modes | Augment Code

Author notes 'A fact store cannot detect [alignment drift]; the facts stay correct while the trajectory goes wrong.' Suggests RAG retrieval alone is insufficient for maintaining agent alignment—need goal spec, not just facts.

Claude Code Q1 2026 Update Roundup: Every Feature That Actually Matters | MindStudio

AutoDream inverts typical RAG: instead of retrieving relevant context post-hoc, it pre-generates structured context. Different approach to same problem (context availability).

@shao__meng: 不！它严重低估了实际工程复杂度。

Author argues against filesystem abstractions (like AGENTS.md as 'memory') and for direct database access with SQL. This contradicts simplified RAG patterns and suggests retrieval should be handled by the system layer, not the harness.

Evidence chain (38 articles, showing 38)

Complete LangGraph Tutorial Beginner To Advance 2026 | RAG-Full-Course - YouTube example_of

RAG integration is mentioned as core topic; RAG is fundamentally about context retrieval and prioritization

Context Engineering for AI Agents (2025): The Complete Guide | Prompt Builder | Prompt Builder example_of

Retrieved documents (Layer 3) are positioned as a distinct context layer; this implies RAG as a context engineering problem, not just a search problem.

Context Engineering vs Prompt Engineering | DataHub example_of

RAG is a specific implementation of context engineering—managing what knowledge the model accesses at decision time.

Artificial Intelligence & Deep Learning | Agentic Context Engineering (ACE): Self-Improving LLMs via Evolving Contexts, Not Fine-Tuning | Facebook supports

The Reflector role (evaluating what context stays) aligns with relevance scoring in RAG systems, but at the composition level rather than retrieval level.

Why Legal AI Hallucinations Are Three Different Problems, And Most Tools Only Catch One example_of

The failure mode 'using wrong or irrelevant cases' is a RAG failure—retrieving documents that don't actually answer the query or contradict other retrieved documents. The solution requires smarter ret

@dhasandev: harness = context manager on behalf of the model example_of

Offloading and compaction strategies are essentially RAG-like retrieval patterns applied within agent context management.

context-engineering/README.md at main · bonigarcia/context-engineering · GitHub supports

External knowledge as a context source directly relates to RAG patterns and retrieval strategy design

Replit — Model Context Protocol (MCP): A Comprehensive Guide supports

MCP servers can provide the retrieval layer for RAG systems. The protocol standardizes how to surface retrieved context to AI models.

@elithrar: This was an idea that came out of left field as we were building Artifacts. example_of

ArtifactFS is a specialized RAG pattern: retrieve critical path (file tree), fetch detailed content on-demand. Priority-based retrieval.

@dhasandev: The business workflow is also a context window. supports

Compaction and offloading strategies map to retrieval/compression patterns in RAG systems

The Pulse: token spend breaks budgets – what next? example_of

Better token management often means better retrieval strategies (RAG) to avoid redundant context. Cost constraints incentivize efficient information retrieval rather than dumping all context into ever

The AI engineering stack we built internally supports

AGENTS.md generation and retrieval mimics RAG pattern—compress and index context (repo guidance) for efficient agent access.

State of AI Agent Memory 2026 - Mem0 extends

LlamaIndex integration with document-heavy RAG pipelines suggests memory layer must support semantic + structured retrieval—moving beyond pure semantic search.

I Spent Three Days at AI Engineer Europe. Here's What Actually Matters supports

Distributing context through shared repos and registries, evaluating context quality, observing how agents use it—these align with RAG system patterns and retrieval evaluation practices.

Graph of Agents: Principled Long Context Modeling by Emergent Multi-Agent Collaboration | OpenReview supports

Demonstrates 5.7% improvement on retrieval-augmented generation through better context collaboration—shows RAG can be improved via orchestration, not just retrieval quality.

Context Engineering: A Methodology for Structured Human-AI Collaboration supports

Context package composition and type libraries are methodologically related to what information to retrieve and how to structure it for AI consumption.

What is MCP (Model Context Protocol)? A Developer's Guide – Encore supports

MCP servers can expose retrieval capabilities (database access, file system, APIs). This is one mechanism for implementing RAG-like patterns.

@badlogicgames: Call out now also part of pi's docs. Just do it. example_of

Agent traces could become a new retrieval source: 'retrieve similar agent interactions' to improve decision-making. This is RAG applied to behavioral patterns.

@testingcatalog: OPENAI 🔥: Codex on macOS now supports Appshots, allowing users to quickly ad... extends

This is RAG applied to the user's current environment rather than a knowledge base. It's retrieval optimized for 'what's currently relevant to the user' vs 'what matches a query.'

@zeeg: if you want to come work on things like this i have open recs at Sentry extends

The pattern of integrating multiple system sources into agent context is similar to RAG architecture—pulling relevant context from multiple sources. The difference is real-time system access rather th

ICLR 2026: ACE Framework Boosts LLM Performance | Andriy Burkov posted on the topic | LinkedIn example_of

Akash Dolas comment specifically mentions 'lost in the middle' phenomenon comparison, suggesting ACE as alternative/complement to RAG approaches

Multi-Agent in Production in 2026: What Actually Survived | by Micheal Lanham | Apr, 2026 | Medium example_of

Shared artifacts pattern resembles RAG external memory. Bounding what agents see prevents redundant retrieval and re-processing.

@GaryMarcus: this is … odd. supports

Marcus's implicit solution (code reviews, oversight, roadmaps) is essentially RAG: retrieval of organizational context (design docs, prior decisions, scope) to augment code generation prompts. Without

@shao__meng: 还记得前段时间 Google 某位总监吐槽：我们内部做了一年的事，Claude Code 几个小时就干完了！ example_of

Google's failure to distribute knowledge about Claude Code capabilities is a failure of organizational RAG—they have no system to retrieve and surface external context (best practices, competitor benc

How AI Agents Actually Remember Things | by Dylan Oh | Apr, 2026 | Level Up Coding supports

Sub-agent summarization mirrors RAG's summarization step; both are compression mechanisms for controlling context

@shao__meng: Apify 开源了一套专门面向网络抓取、数据提取和自动化的 Skills @apify extends

The pattern of connecting structured data extractors to agents resembles RAG—returning relevant external information to augment the agent's context. Apify pre-structures the data, which mirrors RAG ch

@shao__meng: **Kimi 发布了浏览器扩展 ~ Kimi Web Bridgkimi.com/features/webbr…D8vy extends

Live web context retrieval (scrolling, clicking to find information) is a real-time alternative to batch RAG retrieval

Multi-Agent Orchestration with n8n in 2025: From Concept to Practical AI Systems | by Angelo Sorte | Medium example_of

The 'knowledge retrieval' step in the reference pipeline is a form of RAG; shows retrieval as one node in a multi-agent pipeline.

Practical Multi AI Agents and Advanced Use Cases with crewAI - DeepLearning.AI extends

Integration with external systems and tools implies retrieval patterns, though not explicitly discussed as RAG architecture choice.

@JeffBohren: I have been doing software development for nearly forty years and there is on... extends

This implies that coding AI systems need excellent RAG for architectural information—being able to retrieve relevant dependent files without hallucinating scope.

@RickLamers: Awesome initiative by @badlogicgames and @huggingface! extends

Trace datasets function as retrieval-augmented training data for agents; the sanitization step mirrors data quality concerns in RAG pipelines.

@jonas: Yes - but also just in the responses api in general supports

If you can set deadline/cost constraints, your retrieval strategy must adapt—expensive semantic search vs cheap lexical matching depending on constraint.

Multi-Agent AI Systems: Architecture & Failure Modes | Augment Code contradicts

@akseljoonas: Introducing ml-intern, the agent that just automated the post-training team @... example_of

Agent retrieves papers and datasets intelligently but no detail on retrieval ranking, context window management for large document sets, or citation graph traversal strategy

The Future of MCP: Roadmap, Enhancements, and What's Next supports

Remote MCP servers and observability tooling could enable better RAG architectures, though the article doesn't explicitly discuss this application.

Claude Code Q1 2026 Update Roundup: Every Feature That Actually Matters | MindStudio contradicts

AutoDream inverts typical RAG: instead of retrieving relevant context post-hoc, it pre-generates structured context. Different approach to same problem (context availability).

@shao__meng: 不！它严重低估了实际工程复杂度。 contradicts

Benchmarking AI Agent Frameworks in 2026: AutoAgents (Rust) vs LangChain, LangGraph, LlamaIndex, PydanticAI, and more - DEV Community supports

Memory-efficient frameworks enable more aggressive RAG caching and retrieval strategies since you have more budget for in-memory state.

query this concept

$ db.articles("rag-retrieval-strategies")

$ db.cooccurrence("rag-retrieval-strategies")

$ db.contradictions("rag-retrieval-strategies")