Technology
Image
Automates data normalization by resizing images to 224x224 via Pillow and transcoding audio into uniform 16kHz mono formats.
This workflow automates the heavy lifting of data preparation for multimodal AI. We use Pillow to force images into a 224x224 pixel square (the standard for ResNet and VGG architectures) while maintaining aspect ratio through smart padding. On the audio side, we leverage FFmpeg to transcode diverse formats into 16kHz mono WAV files: this ensures consistent sample rates for downstream spectrogram generation. It is a no-nonsense approach to cleaning noise and unifying inputs before they hit the training loop.
72 projects
·
49 cities
Related technologies
Recent Talks & Demos
Showing 1-24 of 72
Crowd-Sourced AI Job Aggregator
Lausanne
Apr 30
Python
Playwright
Shop Talk
St Louis
Apr 14
BLIP
CLIP
Gemini Local Hub: Stateful Agents
Seattle
Apr 13
Next
React
Ágora: AI Marketing Campaign Simulator
São Paulo
Mar 26
React 18
TypeScript
3D Gaussian Splatting Memory Reconstruction
Hong Kong
Mar 26
3D Gaussian Splatting
Meta SAM 3D
Loupe: Claude Code GTM Pipelines
Seattle
Mar 25
Claude Code
Supabase
UofT: Intelligent Document Search
Toronto
Mar 25
Python
FastAPI
AI WhatsApp Fallas Guide
Valencia
Mar 17
OpenAI API
RAG
Words to World: AI Models
San Diego
Feb 26
Unreal Engine 5
PyTorch
MédicalHub: Pneumothorax Detection AI
Conakry
Feb 25
DenseNet
ImageNet
Local OCR for Administrative Workflows
Tokyo
Feb 19
Tesseract
Multimodal AI
Deadlift Back Curvature Tracking
Raleigh
Feb 11
MediaPipe
Pose tracking
IgnitionAI: AI Production Workflow
St Louis
Feb 4
Vertex AI (Gemini
Imagen)
Infinite Wiki: Cooperative World Building
Cologne
Jan 21
RAG
Knowledge Graphs
Archingeo: AI Safety in Infrastructure
Orange County
Jan 14
NVIDIA Jetson
PyTorch
Minds & Models: Real-time Santa AI
Prague
Dec 16
Python
OpenCV
DARIA: Multi-modal Assessment Pipeline
Raleigh
Dec 10
React
Python
VeloBlanco: Automating Media Verification
Bogotá
Nov 27
GPT-4
Claude
DungeonMind: D&D AI Tools
Denver
Nov 24
GPT-5
Image generation
Youz: AI Landmark Guide
Nürnberg
Nov 20
GPT-4
Claude-3
Secure AI Health Assistant with EHR
Dhaka
Nov 1
OpenAI API
FastAPI
Citrus-Inventario: AI Inventory PDFs
Pereira
Oct 30
FastAPI
WhatsApp
AI Tax Calculator for Canada
Toronto
Oct 30
Manus AI
GPT-4o
Worldlabs: Single Image Worldbuilding
Toronto
Oct 30
Gaussian Splatting
PlayCanvas