Technology

granite-embedding 278m

IBM's high-efficiency 278M parameter model optimized for multilingual text embeddings and high-performance RAG workflows.

The granite-embedding 278m model delivers a lean, high-performance solution for dense vector retrieval across 38 languages. Built on a transformer architecture with a 512-token context window, it balances a compact 278-million parameter footprint with state-of-the-art accuracy on the MTEB benchmark. IBM engineered this model specifically for Retrieval-Augmented Generation (RAG) tasks, ensuring low-latency inference and precise semantic search capabilities for enterprise AI pipelines.

1 project · 1 city

Related technologies

Recent Talks & Demos

Showing 1-1 of 1

Members-Only