.

Technology

Image

Automates data normalization by resizing images to 224x224 via Pillow and transcoding audio into uniform 16kHz mono formats.

This workflow automates the heavy lifting of data preparation for multimodal AI. We use Pillow to force images into a 224x224 pixel square (the standard for ResNet and VGG architectures) while maintaining aspect ratio through smart padding. On the audio side, we leverage FFmpeg to transcode diverse formats into 16kHz mono WAV files: this ensures consistent sample rates for downstream spectrogram generation. It is a no-nonsense approach to cleaning noise and unifying inputs before they hit the training loop.

https://pillow.readthedocs.io/
72 projects · 49 cities

Related technologies

Recent Talks & Demos

Showing 21-44 of 72

Members-Only

Sign in to see who built these projects

Secure AI Health Assistant with EHR
Dhaka Nov 1
OpenAI API FastAPI
Citrus-Inventario: AI Inventory PDFs
Pereira Oct 30
FastAPI WhatsApp
AI Tax Calculator for Canada
Toronto Oct 30
Manus AI GPT-4o
Worldlabs: Single Image Worldbuilding
Toronto Oct 30
Gaussian Splatting PlayCanvas
Teaching AI Maya Glyphs
Montreal Oct 21
YOLOv8 ResNet
Google Cloud: High-Performance AI
Austin Oct 9
Gemini Veo3
Story Machine: 5-Minute AI Music Video
London Oct 7
Claude Mistral
Read.ai, OpenAI, LinkedIn Marketing Automation
Seattle Sep 30
OpenAI API DALL-E
FiftyOne Visual Similarity Search
Raleigh Sep 30
FiftyOne CLIP
Uptalk: AI Language Chat
Boston Sep 29
iOS OpenAI API
Vibe Coding: IT Manager to AI
Hong Kong Sep 29
Next Supabase
AI Card Scanner Prompt Engineering
Hong Kong Sep 29
AWS Lambda Amazon Textract
Dilbert: AI Matches News Comics
Hong Kong Sep 29
Qwen image-to-text
AI-B2in: AI Animation In-Betweens
Manizales Sep 24
React TypeScript
AI Multi-Hazard Alert System
Oslo Sep 19
Python scikit-learn
Agent Builds Declarative AI Workflows
Paris Sep 18
Pipelex Python
Anamorpher: Downscaling Prompt Injection
New York City Sep 17
GPT-4 LangChain
Benchmarking LLMs for Fraud Detection
Minneapolis Saint Paul Sep 10
AWS Bedrock LangChain
Chaos to Concurrency with AI
Pereira Aug 28
Google Gemini Python
Homemade and Follow AI Platforms
Amsterdam Aug 27
TAO LLM
AI Road Infrastructure Management
New York City Aug 26
OpenAI API PyTorch
CrewAI: Automated Book Writing
Boston Aug 25
GPT-4o OpenAI API
Zod: Structured State, Fixed Context
Boston Jul 28
Gemini Imagen 4
VidyaNav-ai: AI Classroom Assistant
Munich Jul 25
FastAPI Vertex AI