← All concepts

retrieval augmented generation

217 articles · 15 co-occurring · 10 contradictions · 13 briefs

It's RAG on steroids... ADAPTIVE RETRIEVAL: The AI literally asks itself 'Do I need to Google this, or do I already know?'" — The article explicitly positions Deep Research as an evolution of RAG, add

Context Engineering : Critical Shift from Prompting to Engineering

Article positions RAG as outdated (2024-2025) and superseded by Vector Orchestration. This is contrarian to current market reality where RAG remains dominant.

NeurIPS Poster Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks

Related but distinct: RAG retrieves external documents; this retrieves the agent's own successful behaviors. Different retrieval source and purpose.

RAG Is Dead. Long Live Context Engineering for LLM Systems

[strong] "But lately, in real-world projects, I find myself proposing something much simpler far more often: Pure LLM API implementation with structured context injection. No full RAG stack." — Article directly challenges RAG as the default production pattern, proposing simpler alternatives instead.

@shao__meng: @arlanr 指出,AI 写代码时出现幻觉,根源不在于模型能力不足,而在于训练数据滞后——API 每天都在变更,而模型的知识停留在数月甚至数年前。

[strong] "Retrieval gives you fragments, but agents need the whole picture." — Article explicitly challenges RAG as insufficient for agents because fragmentation loses context across pages and function signatures

@sachinrekhi: It's pretty fascinating how simple ChatGPT's memory approach is.

[STRONG] "Spoiler alert: There is no RAG used" — Article directly contradicts RAG-based approaches by asserting ChatGPT's memory does not use RAG, challenging common assumptions

@Hesamation: it's funny how everyone is losing their mind on LinkedIn over RAG, everyday t...

[CLAIMED] "everyday talking about a new one: Graph RAG, Agentic RAG, Naive RAG as if it's the foundation of AI. yet I have not seen any project that uses RAG in practice." — Author claims widespread LinkedIn hype about RAG variants but observes lack of real-world adoption, contradicting the presented importance of RAG as foundational.

@Hesamation: why do they even bother with the benchmarking when it doesn't mean anything?

[INFERRED] "why do they even bother with the benchmarking when it doesn't mean anything?" — Social media critique arguing benchmarks lack meaningful predictive value; challenges the utility of benchmark-driven evaluation

@jeffreyhuber: doing well on AI benchmarks is like having a great one rep max - impressive b...

[INFERRED] "doing well on AI benchmarks is like having a great one rep max - impressive but wholly unrelated to daily utility" — Article challenges the assumption that benchmark performance correlates with practical utility, arguing they measure different things entirely

Prompt Engineering Is Dead, and Context Engineering Is Already Obsolete: Why the Future Is Automated Workflow Architecture with LLMs - Prompting - OpenAI Developer Community

[STRONG] "Even the most advanced retrieval systems struggle when each step of a workflow needs different context, formatted in different ways, and tied to evolving business rules" — Article identifies fundamental limitation of retrieval systems when applied to complex, multi-step workflows requiring context reformatting and dynamic business logic

@simonw: Short musings on "cognitive debt" - I'm seeing this in my own work, where exc...

[INFERRED] "AI-generated code" — Article highlights unintended negative consequence of AI code generation (cognitive debt), contradicting narrative of pure productivity gains without oversight costs.

2026-W15
969
2026-W14
455
2026-W12
2

It's RAG on steroids... ADAPTIVE RETRIEVAL: The AI literally asks itself 'Do I need to Google this, or do I already know?'" — The article explicitly positions Deep Research as an evolution of RAG, add

Effective context engineering here relies on retrieval-augmented techniques like using vector search, RAG, or other filtering methods to inject only the most relevant facts, tools for the current task

[direct] "Wrong documents were retrieved. Right documents were never ingested. Conflicting snippets from different time ranges were shown together." — Article adds nuance to RAG by identifying specifi

This article directly addresses which retrieval strategy works best in RAG pipelines—a core RAG implementation concern

Article explicitly names RAG as a primary technique for handling large knowledge bases within context constraints

Parent-document retrieval is a specific RAG implementation pattern optimizing the retrieval component

Identified as core mechanism for enriching agent context with external knowledge. Shows how context preservation requires explicit retrieval integration.

Article explicitly lists RAG as a context engineering technique for shaping model responses

RAG: inject retrieved documents based on relevance to user intent" — Article explicitly lists RAG as a context engineering design pattern with practical demonstration in the sci-fi film example.

But lately, in real-world projects, I find myself proposing something much simpler far more often: Pure LLM API implementation with structured context injection. No full RAG stack." — Article directly

retrieval-augmented generation (RAG)" — Survey identifies RAG as a key architectural implementation of Context Engineering, integrating context retrieval and processing components.

the discipline of designing, optimizing, and orchestrating what goes into an LLM's context — and present a unified taxonomy from core components (retrieval/generation, processing, management) to full

walks you through building prompt-driven tools, retrieval-augmented generation (RAG) systems" — Article explicitly covers RAG as a core hands-on topic for developers building LLM applications

AI Design Patterns: Understanding RAG Pattern" — Article focuses on RAG pattern as a design pattern, demonstrating the concept

RAG Agent: Create a Retrieval Augmented Generation (RAG) agent. LangGraph: Build a RAG agent using LangGraph primitives for fine-grained control." — Documentation explicitly provides RAG agent tutoria

adds a decision loop that checks and refines retrieved information before answering" — Agentic RAG explicitly extends standard RAG by introducing decision loops for refinement and validation.

[high] "Agents wrote 100% of everything you see incl. Nix flakes, CI jobs, etc." — Direct demonstration of agents autonomously generating production code including infrastructure and CI configuration

RAG --> Selectively inject relevant information into context" — Article directly discusses RAG as a core technique for managing context in LLM applications, with explicit focus on selective informatio

Hybrid search = precision of lexical search + intuition of semantic search" — Article articulates a novel hybrid approach combining lexical precision with semantic intuition, extending traditional ret

能力评估:回答"它能做什么?",初始通过率较低,用于推动进步 vs 回归评估:回答"它还能用吗?",应保持接近 100% 通过率" — Article provides framework distinguishing capability evals (exploratory) from regression evals (protective), core testing strategy

Legal RAG systems typically take 3-6 months to build. We did it in 36 hours." — Article demonstrates a practical implementation of RAG for legal documents, showing rapid development and operational su

"Don't waste your time on RAG, subagents, agents 2.0 or other things that are mostly just charade. Just talk to it."" — Peter explicitly dismisses RAG as a waste of time and part of AI "charade," argu

Most teams build RAG systems that only see text." — Article directly addresses RAG system architecture and proposes multimodal hybrid search as an extension to traditional text-only RAG approaches, de

Top RAG systems today combine BM25 with vector search. You get the best of both worlds: semantic understanding AND precise keyword matching." — Article explicitly discusses how modern RAG systems inte

RAG grounds agents in real-world data, allowing them to safely interact with internal documents, retrieve key information, and cite actual regulations and policies." — Article explicitly argues that R

Let's look into the process of tracing from a perspective of a naive RAG system." — Article demonstrates tracing implementation specifically using a RAG system as the concrete example, showing how spa

It also includes the user's query, retrieved documents, conversation history, and other relevant data." — Article directly addresses how RAG systems work in production and discusses the retrieved docu

Takes one day. Pushes accuracy to 85-90% every time" — Article provides empirical evidence that combined strategy (hybrid search + chunk optimization + reranking) consistently achieves 85-90% accuracy

Implement practical RAG pipelines including chunking, embedding, retrieval, hybrid search, and contextual filtering" — Book provides comprehensive RAG implementation patterns including all major compo

solve the LLM context problem with RAG" — Article explicitly identifies RAG as a primary strategy for addressing LLM context limitations

grounded citations. The UI shows you which chunks were used, so you can trace responses back to source material" — RAGFlow implements core RAG pattern with citation grounding and traceability, showing

Most people think of notes as information. You write something down, you file it somewhere, you search for it later. The note is a record. The system is a library. Your job is retrieval... This framin

Resources : Context and data, for the user or the AI model to use" — MCP's Resources feature provides context and data to AI models, which is the core mechanism enabling RAG patterns to function effec

acts as gateway to semantically search hf docs... add to which uses latest open weights LLMs" — Article describes complete RAG pipeline: semantic search over documents feeding into LLM generation

The final spec is a short 2-min read that includes: Context, Principles (non-negotiable constraints), Requirements..." — Ramp uses Claude Code to generate structured product specifications with consis

Designed and implemented LLM-based AI agents for internal workflow automation, integrating RAG, tool invocation, and backend APIs, increasing internal team productivity by 40%." — Concrete production

Why retrieval fails for complex reasoning tasks. How knowledge graphs connect information that vector search misses." — Article extends RAG understanding by discussing failure modes and alternatives l

A GitHub with repos from this month using Claude Code, multi-agent orchestration, or RAG evaluation frameworks tells them you're current." — Article explicitly names RAG evaluation frameworks as a mar

LANDAU, a layered knowledge base preserving retrieved papers, curated prior knowledge, and validated methodology traces" — LANDAU explicitly implements RAG by retrieving and preserving papers alongsid

LangChain provides comprehensive RAG tooling but operates primarily through single-agent execution patterns." — Article directly discusses LangChain's RAG capabilities and tooling as a key feature

两周后数据库有 500 条记录,用户问'我的工作情况',返回 12 段矛盾片段。Agent 幻觉出错误综合答案。关键认知:Embedding 衡量相似性,不是真实性。" — Article identifies a critical limitation in RAG: embeddings measure similarity not veracity, leading to contradic

The winning approach removed the decision entirely. An 8KB compressed index embedded in AGENTS[.]md, with one instruction: "Prefer retrieval-led reasoning over pre-training-led reasoning."" — Article

Build a working memory source, like a vector database that your agent queries on demand" — Demonstrates RAG pattern in production: storing context externally in a vector database and retrieving releva

In practice, an entire context engine can be built to automate these steps. A context engine is a software system that sits between the user and the LLM, orchestrating all the context assembly in real

Evolution of Context Engineering systems from 2020 to 2025, showing the progression from basic RAG implementations to sophisticated multi-agent systems" — Article traces RAG as foundational implementa

[DIRECT] "RAG architectures, popularized by the 2020 Lewis et al. paper, brought this concept into the LLM era. But RAG is just the retrieval mechanism. Context engineering is the discipline of decidi

retrieval pipelines that fetch relevant documents" — Article explicitly lists retrieval pipelines as a core component of context engineering, which feeds context design decisions.

Retrieval gives you fragments, but agents need the whole picture." — Article explicitly challenges RAG as insufficient for agents because fragmentation loses context across pages and function signatur

Answering questions such as "Explain retrieval-augmented generation" can be achieved with a given set of data sources and some reasoning to compare them" — Shows RAG as practical methodology for quest

RAG grounds them in your data. It's the most common pattern for building production AI applications, from customer support bots to internal knowledge assistants. Understanding chunking strategies, emb

query this concept
$ db.articles("retrieval-augmented-generation")
$ db.cooccurrence("retrieval-augmented-generation")
$ db.contradictions("retrieval-augmented-generation")