.

Technology

Gemini

Google's natively multimodal AI model: understands and operates across text, code, audio, image, and video.

Gemini is Google's most capable and general AI model, engineered from the ground up to be natively multimodal: it seamlessly understands and combines information across text, code, audio, image, and video inputs. The technology is optimized for flexibility, running efficiently on everything from data centers to mobile devices. It is deployed in three key sizes: Ultra (for highly complex tasks), Pro (for broad scaling), and Nano (for efficient on-device tasks). Developers access this power via the Gemini API to build next-generation applications.

https://deepmind.google/technologies/gemini/
254 projects · 70 cities

Related technologies

Recent Talks & Demos

Showing 241-254 of 254

Members-Only

Sign in to see who built these projects