Technology

Make-A-Video

Meta AI's generative system transforms text prompts into high-definition video clips using a diffusion model trained on millions of data points.

Meta AI researchers designed Make-A-Video to generate five-second video clips from simple text descriptions. The architecture utilizes a Spatio-Temporal Graph Convolutional Network (ST-GCN) trained on the WebVid-10M dataset (10 million video-text pairs) and the HD-VILA-100M dataset. By decoupling motion from appearance, the system learns how objects move from video-only data while maintaining visual fidelity from image-text pairs. Output specifications include a 16-frame-per-second rate at 768x768 resolution. Examples range from a stylized dog wearing a superhero cape to realistic waves crashing on a beach. This technology eliminates the need for manual video editing for short-form creative assets.

https://makeavideo.studio/

2 projects · 2 cities

Related technologies

Imagen Video 2 Runway Gen-2 3 AR platforms 1 DALL-E 7 D-ID Creative Reality Studio 1 ElevenLabs 51 Generate API 1 GPT-4 678 Oscar 1 Search API 9 Stable Video Diffusion 1 Synthesia 1 UniVL 1 VideoBERT 1 VideoCLIP 1 Video embeddings 2

Recent Talks & Demos

Showing 1-2 of 2

Members-Only

KathaBook

Toronto May 30

GPT-4 DALL-E

Twelve Labs: Chatting with Video

New York City Oct 26

Generate API VideoBERT