Technology

RAG-Token

RAG-Token is a sequence-to-sequence generation model that retrieves relevant documents to predict each target token based on a distinct latent document distribution.

RAG-Token (Retrieval-Augmented Generation) optimizes language generation by performing document retrieval at the individual token level rather than the entire sequence. Developed by researchers at Facebook AI Research (FAIR), University College London, and NYU, this architecture allows the model to shift its knowledge source for every word generated. By marginalizing over a set of top-k retrieved documents (typically k=5 or k=10), RAG-Token outperforms standard parametric models on knowledge-intensive tasks like Natural Questions and Jeopardy! while maintaining a significantly lower hallucination rate.

https://arxiv.org/abs/2005.11401

3 projects · 3 cities

Related technologies

BART 4 DPR 3 FAISS 17 Haystack 5 LangChain 441 RAG-Sequence 3 T5 5 Agentic 2 Agentic RAG 3 AI infrastructure 2 BERT 179 BLOOM 115 CanDoo 1 CPU 3 GPT-3 191 GPT-4 528 Llama-2 227 PaLM 2 116

Recent Talks & Demos

Showing 1-3 of 3

Members-Only

Optimizing Agentic RAG Reliability

DC Nov 11

RELAI RELAI SDK

React Compiler for LLMs

Amsterdam Nov 12

React GPT-4

Natural Language Compiler

Toronto Apr 11

CanDoo DPR