.

Technology

Image

Automates data normalization by resizing images to 224x224 via Pillow and transcoding audio into uniform 16kHz mono formats.

This workflow automates the heavy lifting of data preparation for multimodal AI. We use Pillow to force images into a 224x224 pixel square (the standard for ResNet and VGG architectures) while maintaining aspect ratio through smart padding. On the audio side, we leverage FFmpeg to transcode diverse formats into 16kHz mono WAV files: this ensures consistent sample rates for downstream spectrogram generation. It is a no-nonsense approach to cleaning noise and unifying inputs before they hit the training loop.

https://pillow.readthedocs.io/
72 projects · 51 cities

Related technologies

Recent Talks & Demos

Showing 1-24 of 72

Members-Only

Sign in to see who built these projects

Crowd-Sourced AI Job Aggregator
Lausanne Apr 30
Python Playwright
Shop Talk
St Louis Apr 14
BLIP CLIP
Gemini Local Hub: Stateful Agents
Seattle Apr 13
Next React
Ágora: AI Marketing Campaign Simulator
São Paulo Mar 26
React 18 TypeScript
3D Gaussian Splatting Memory Reconstruction
Hong Kong Mar 26
3D Gaussian Splatting Meta SAM 3D
Loupe: Claude Code GTM Pipelines
Seattle Mar 25
Claude Code Supabase
UofT: Intelligent Document Search
Toronto Mar 25
Python FastAPI
AI WhatsApp Fallas Guide
Valencia Mar 17
OpenAI API RAG
Words to World: AI Models
San Diego Feb 26
Unreal Engine 5 PyTorch
MédicalHub: Pneumothorax Detection AI
Conakry Feb 25
DenseNet ImageNet
Local OCR for Administrative Workflows
Tokyo Feb 19
Tesseract Multimodal AI
Deadlift Back Curvature Tracking
Raleigh Feb 11
MediaPipe Pose tracking
IgnitionAI: AI Production Workflow
St Louis Feb 4
Vertex AI (Gemini Imagen)
Infinite Wiki: Cooperative World Building
Cologne Jan 21
RAG Knowledge Graphs
Archingeo: AI Safety in Infrastructure
Orange County Jan 14
NVIDIA Jetson PyTorch
Minds & Models: Real-time Santa AI
Prague Dec 16
Python OpenCV
DARIA: Multi-modal Assessment Pipeline
Raleigh Dec 10
React Python
VeloBlanco: Automating Media Verification
Bogotá Nov 27
GPT-4 Claude
DungeonMind: D&D AI Tools
Denver Nov 24
GPT-5 Image generation
Youz: AI Landmark Guide
Nürnberg Nov 20
GPT-4 Claude-3
Secure AI Health Assistant with EHR
Dhaka Nov 1
OpenAI API FastAPI
Citrus-Inventario: AI Inventory PDFs
Pereira Oct 30
FastAPI WhatsApp
AI Tax Calculator for Canada
Toronto Oct 30
Manus AI GPT-4o
Worldlabs: Single Image Worldbuilding
Toronto Oct 30
Gaussian Splatting PlayCanvas