RAG

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2023 Oct 25 11:31
Editor
Edited
Edited
2025 Aug 4 21:13

Retrieval-Augmented Generation

RAG is not Memory, Search or Memory is better than naive RAG. Better retrieval performance ≠ Better performance

Challenges

  • retrieval latency
  • system complexity
RAG Notion
https://arxiv.org/pdf/2309.01431.pdf
RAG Usages
 
 
 
 
https://arxiv.org/pdf/2312.10997v1.pdf
https://arxiv.org/pdf/2312.10997v1.pdf

Beyond dot product

https://www.youtube.com/watch?v=mE7IDf2SmJg
 
 
 
 
Stanford CS25: V3 I Retrieval Augmented Language Models
December 5, 2023 Douwe Kiela, Contextual AI Language models have led to amazing progress, but they also have important shortcomings. One solution for many of these shortcomings is retrieval augmentation. I will introduce the topic, survey recent literature on retrieval augmented language models and finish with some of the main open questions. More about the course can be found here: https://web.stanford.edu/class/cs25/ View the entire CS25 Transformers United playlist: https://www.youtube.com/playlist?list=PLoROMvodv4rNiJRchCzutFw5ItR_Z27CM
Stanford CS25: V3 I Retrieval Augmented Language Models
Intro of Retrieval Augmented Generation (RAG) and application demos
Introduction of Retrieval Augmented Generation, Jupyter Notebook three demos of Basic RAG, Sentence-window retrieval, Auto-merging…
Intro of Retrieval Augmented Generation (RAG) and application demos
NVIDIA Research: RAG with Long Context LLMs
This blog post dives into NVIDIA’s recent study comparing retrieval-augmentation with and without long-context LLMs.
NVIDIA Research: RAG with Long Context LLMs

KBQA
,
Knowledge Graph

ML Blog - Improve ChatGPT with Knowledge Graphs
Leveraging knowledge graphs for LLMs using LangChain
llama-recipes/recipes/use_cases/agents/langchain at main · meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization an...
llama-recipes/recipes/use_cases/agents/langchain at main · meta-llama/llama-recipes
Base model is better at retrieval than
Instruction Tuning
model
arxiv.org
 
 

Recommendations