Technology

granite-embedding 278m for RAG

A high-efficiency 278-million parameter model optimized for enterprise RAG pipelines and multilingual document retrieval.

IBM's Granite 278M embedding model delivers a precise balance of performance and speed for Retrieval-Augmented Generation (RAG). It supports a 512-token context window and handles 15 languages (including English, German, and Japanese) to ensure global utility. By mapping text to 768-dimensional vectors, it enables sub-millisecond similarity searches across massive document stores. This model is specifically tuned for business use cases where low latency and high retrieval accuracy are non-negotiable requirements for LLM grounding.

https://huggingface.co/ibm-granite/granite-embedding-278m-multilingual

1 project · 1 city

Related technologies

AI agents 35 dice rolls 1 Docker 128 granite-embedding 278m 1 Jan-nano-gguf 1 monsters 1 Qwen-2 3 RAG 138 Scratch 2

Recent Talks & Demos

Showing 1-1 of 1

Members-Only

Compose and Dragons: Tiny LLMs

Paris Apr 21

Docker Jan-nano-gguf