.

Technology

OCR

Optical Character Recognition (OCR) is the foundational technology that converts typed, printed, or handwritten text from images (scans, JPEGs, PDFs) into machine-readable, searchable data.

OCR is a critical data extraction tool: it transforms non-editable text in digital images into structured, actionable information. The process involves image analysis, character recognition (using pattern matching or feature extraction), and post-processing for accuracy. Modern systems, leveraging AI/ML (Intelligent Character Recognition or ICR), achieve high-accuracy rates, often exceeding 99% on clean documents. Key applications include automating data entry for high-volume documents (invoices, receipts, bank statements), digitizing historical archives for searchability (e.g., Google Books), and real-time functions like license plate recognition (LPR) in traffic systems. This technology cuts manual data entry time and enables powerful text-based analytics.

https://cloud.google.com/vision/docs/ocr
27 projects · 24 cities

Related technologies

Recent Talks & Demos

Showing 1-24 of 27

Members-Only

Sign in to see who built these projects

Nixus: Orchestrating Agentic Infrastructure
Hong Kong Apr 29
Nix NixOS
DroneQRF: Predictive Drone Security Platform
Johannesburg Mar 31
Node Express
Holocron: Skilled Trades Operating System
Eastside Entrepreneurs Mar 5
v0 Figma
Detect Issues, Fix with Agents
Prague Feb 26
Gemini 3 Temporal
Local OCR for Administrative Workflows
Tokyo Feb 19
Tesseract Multimodal AI
PaddleOCR Layout to SQL
Hong Kong Jan 20
PaddleOCR-VL ERNIE 5
CLU: Multi-Agent Orchestration
Miami Dec 18
CLU GRID Framework
Sentence Transformers: Content Categorization
Nürnberg Nov 20
GPT-4 LangChain
Claude Agents with SigAgent Tracing
Boston Nov 17
Claude Code Python
ZoningPal: Toronto Zoning By-Law AI
Toronto Nov 10
Python Node
Supervised Socratic AI Tutors
Dhaka Nov 1
Qdrant FastAPI
AI Card Scanner Prompt Engineering
Hong Kong Sep 29
AWS Lambda Amazon Textract
Cursor Personal Knowledge Management Tool
Nairobi Sep 25
Cursor Tampermonkey
Agent Builds Declarative AI Workflows
Paris Sep 18
Pipelex Python
Benchmarking LLMs for Fraud Detection
Minneapolis Saint Paul Sep 10
AWS Bedrock LangChain
Chaos to Concurrency with AI
Pereira Aug 28
Google Gemini Python
PaddlePaddle: Structuring Legal Docs
Hong Kong Aug 22
Python PaddleOCR
Sistema Inmunológico Adopción IA
Santiago Jul 31
ChatGPT Make
Aura: Local AI Gaming Companion
DC Jul 10
Llama 3 OpenAI Whisper
AI Decodes East Asian Archives
Hong Kong May 29
YOLO spaCy
Document Photo OCR
Paris May 15
GPT-4 LangChain
Video Understanding and Bib OCR
Montreal May 7
Llama 3 DeepSeek
Mistral OCR: Structured Data Extraction
Chicago Apr 15
Mistral OCR API React
AI Agent for DIAN Compliance
Medellín Apr 1
ChatGPT Node