Term
Retrieval
The lookup stage of a RAG pipeline — fetching relevant text chunks from a corpus, given a query embedding. Quality of retrieval is usually the bottleneck on RAG quality, not the LLM itself.
Background
Retrieval strategies range from dense vector search to BM25 keyword search to hybrid combinations. Modern systems re-rank with a cross-encoder for precision. Recall-at-k and MRR are the standard metrics. Chunk size, overlap, and embedding model choice each shift the precision/recall curve meaningfully.