Technology

Groq

Groq delivers ultra-fast AI inference using its custom-built Language Processing Unit (LPU) to accelerate Large Language Models (LLMs) at scale.

Groq specializes in high-speed AI inference, leveraging its proprietary Language Processing Unit (LPU) Inference Engine: a chip specifically architected for generative AI and LLMs. The LPU's unique dataflow architecture bypasses the memory and compute bottlenecks of traditional GPUs, delivering consistent, ultra-low-latency performance and superior energy efficiency. This technology, accessible via the GroqCloud platform or on-premise GroqRack clusters, enables real-time application deployment for demanding enterprise customers. Founded in 2016 by former Google engineers (including a lead designer of the TPU), Groq is setting the new standard for real-time AI compute.

https://groq.com

32 projects · 25 cities

Related technologies

Python 739 OpenAI API 500 LangChain 439 Gemini 254 Next 197 Anthropic 110 Anthropic API 58 Convex 5 Deepgram 18 ElevenLabs 51 FastAPI 159 Groq API 3 LiveKit 14 Llama 3 70B 2 Ollama 82 OpenAI 340 React 260 Streamlit 84

Recent Talks & Demos

Showing 21-32 of 32

Members-Only

Sign in to see who built these projects

Sign in View FAQ

Voice AI Duolingo Practice

Waterloo Apr 21

OpenAI ElevenLabs

Chainlit Voice Interview Agent

Medellín Apr 1

LangChain OpenAI Real Time

Groq: Multi-Modal Agentic Demo

Groq LPU: Fastest AI Inference

Singapore Feb 21

Ollama Groq Local Inference

Manizales Jan 22

Llama-2 Mistral

Doomscroll Detector

New York City Nov 12

Groq Llama 3 70B

podscript: LLM Podcast Transcripts

Bengaluru Oct 17

ChatGPT Anthropic

Groq: AI Triage Automation

Whisper-large-v3 Llama 3 70B

Groq LPU Inference Demos

Autocomplete.sh: LLM Bash Completion

Python OpenAI API

Multimodal Groq Demo

Groq Multimodal Models