.

Technology

IDM-VTON

IDM-VTON (Improving Diffusion Models for Authentic Virtual Try-on in the Wild): a high-fidelity diffusion model that generates realistic, detail-preserving virtual try-on images from a person and garment photo.

IDM-VTON is a cutting-edge diffusion model for image-based virtual try-on, designed to render a person wearing a new garment with unprecedented realism and detail preservation. It leverages a novel architecture built on Stable Diffusion XL, featuring two specialized UNets (TryonNet and GarmentNet) and an IP-Adapter for robust garment encoding. Specifically, it fuses high-level garment semantics via cross-attention and low-level features via self-attention, ensuring fine details like patterns and logos are accurately transferred. This approach significantly outperforms previous GAN-based and diffusion-based methods in quantitative metrics and real-world authenticity, making it a powerful tool for e-commerce and digital fashion applications.

https://github.com/yisol/IDM-VTON
1 project · 2 cities

Related technologies

Recent Talks & Demos

Showing 1-1 of 1

Members-Only

Sign in to see who built these projects