Technology
Image
Automates data normalization by resizing images to 224x224 via Pillow and transcoding audio into uniform 16kHz mono formats.
This workflow automates the heavy lifting of data preparation for multimodal AI. We use Pillow to force images into a 224x224 pixel square (the standard for ResNet and VGG architectures) while maintaining aspect ratio through smart padding. On the audio side, we leverage FFmpeg to transcode diverse formats into 16kHz mono WAV files: this ensures consistent sample rates for downstream spectrogram generation. It is a no-nonsense approach to cleaning noise and unifying inputs before they hit the training loop.
72 projects
·
49 cities
Related technologies
Recent Talks & Demos
Showing 21-44 of 72
Secure AI Health Assistant with EHR
Dhaka
Nov 1
OpenAI API
FastAPI
Citrus-Inventario: AI Inventory PDFs
Pereira
Oct 30
FastAPI
WhatsApp
AI Tax Calculator for Canada
Toronto
Oct 30
Manus AI
GPT-4o
Worldlabs: Single Image Worldbuilding
Toronto
Oct 30
Gaussian Splatting
PlayCanvas
Teaching AI Maya Glyphs
Montreal
Oct 21
YOLOv8
ResNet
Google Cloud: High-Performance AI
Austin
Oct 9
Gemini
Veo3
Story Machine: 5-Minute AI Music Video
London
Oct 7
Claude
Mistral
Read.ai, OpenAI, LinkedIn Marketing Automation
Seattle
Sep 30
OpenAI API
DALL-E
FiftyOne Visual Similarity Search
Raleigh
Sep 30
FiftyOne
CLIP
Uptalk: AI Language Chat
Boston
Sep 29
iOS
OpenAI API
Vibe Coding: IT Manager to AI
Hong Kong
Sep 29
Next
Supabase
AI Card Scanner Prompt Engineering
Hong Kong
Sep 29
AWS Lambda
Amazon Textract
Dilbert: AI Matches News Comics
Hong Kong
Sep 29
Qwen
image-to-text
AI-B2in: AI Animation In-Betweens
Manizales
Sep 24
React
TypeScript
AI Multi-Hazard Alert System
Oslo
Sep 19
Python
scikit-learn
Agent Builds Declarative AI Workflows
Paris
Sep 18
Pipelex
Python
Anamorpher: Downscaling Prompt Injection
New York City
Sep 17
GPT-4
LangChain
Benchmarking LLMs for Fraud Detection
Minneapolis Saint Paul
Sep 10
AWS Bedrock
LangChain
Chaos to Concurrency with AI
Pereira
Aug 28
Google Gemini
Python
Homemade and Follow AI Platforms
Amsterdam
Aug 27
TAO
LLM
AI Road Infrastructure Management
New York City
Aug 26
OpenAI API
PyTorch
CrewAI: Automated Book Writing
Boston
Aug 25
GPT-4o
OpenAI API
Zod: Structured State, Fixed Context
Boston
Jul 28
Gemini
Imagen 4
VidyaNav-ai: AI Classroom Assistant
Munich
Jul 25
FastAPI
Vertex AI