Technology
Image
Automates data normalization by resizing images to 224x224 via Pillow and transcoding audio into uniform 16kHz mono formats.
This workflow automates the heavy lifting of data preparation for multimodal AI. We use Pillow to force images into a 224x224 pixel square (the standard for ResNet and VGG architectures) while maintaining aspect ratio through smart padding. On the audio side, we leverage FFmpeg to transcode diverse formats into 16kHz mono WAV files: this ensures consistent sample rates for downstream spectrogram generation. It is a no-nonsense approach to cleaning noise and unifying inputs before they hit the training loop.
72 projects
·
49 cities
Related technologies
Recent Talks & Demos
Showing 41-64 of 72
AI Road Infrastructure Management
New York City
Aug 26
OpenAI API
PyTorch
CrewAI: Automated Book Writing
Boston
Aug 25
GPT-4o
OpenAI API
Zod: Structured State, Fixed Context
Boston
Jul 28
Gemini
Imagen 4
VidyaNav-ai: AI Classroom Assistant
Munich
Jul 25
FastAPI
Vertex AI
Vertex AI Agents: Imagen Veo
Seattle
Jul 24
Google Gemini
Imagen)
Ecocraft Designer: NBC Plans
Tiruchirappalli
Jul 12
Stable Diffusion XL-base-1
DALL·E 3
Mining opportunities
Santiago
Jun 26
GPT-4
Claude-3
AI Story Generator: 4 LLMs
Dublin
Jun 26
LangChain
LangGraph
Gemini Vertex AI Stress Detection
New York City
Jun 25
Vertex AI
BigQuery ML
AI Icon Generation Challenges
Nashville
Jun 23
OpenRouter
Gemini API
Print Your Prompt
Toronto
Jun 18
Next
OpenAI API
Arthur: LLM Pipeline for Novels
St Louis
Jun 5
OpenAI API
Google Firestore
DSPy: Self-Programming Meta-Agents
New York City
Jun 3
DSPY
vLLM
Artecon: Local CPU AI Hotspot
Seattle
May 30
llama
ONNX
Content Automation from Meetings
Seattle
May 30
OpenAI API
ChatGPT
Spacedust 3D: Fast Character Animation
Seattle
May 30
React
Node
Flujo para Video IA Largo
Medellín
May 29
Google Veo 3
OpenAI gpt-image-1
AI Decodes East Asian Archives
Hong Kong
May 29
YOLO
spaCy
Flujo Generación Video Largo
Manizales
May 28
Google Veo 3
OpenAI gpt-image-1
GPT-4o Circuit Board Art
Toronto
May 22
GPT-4o
KiCad
MCP vs Message Bus AI Agents
Las Vegas
May 8
Redis Streams
MCP
PromptPilot
Rio De Janeiro
Apr 26
GPT-4o
OpenAI API
AI Images for Social Media
Mumbai
Apr 26
Ideogram
Bing Image Creator
Vinchy: AI Fit Matching
Seattle
Apr 24
Flutter
Cloud Run