vibedonaldsvibedonalds.com
Term

Retrieval

The lookup stage of a RAG pipeline — fetching relevant text chunks from a corpus, given a query embedding. Quality of retrieval is usually the bottleneck on RAG quality, not the LLM itself.

Background

Retrieval strategies range from dense vector search to BM25 keyword search to hybrid combinations. Modern systems re-rank with a cross-encoder for precision. Recall-at-k and MRR are the standard metrics. Chunk size, overlap, and embedding model choice each shift the precision/recall curve meaningfully.