Next Generation Media & Device
Open-Sourced Text-to-Video Model: CogVideoX
Speakers
Presentation Slides
Presentation Video
We introduce CogVideoX, a large-scale diffusion transformer model designed for generating videos based on text prompts. Results show that CogVideoX demonstrates state-of-the-art performance across both multiple machine metrics and human evaluations. The model weight of CogVideoX is publicly available at https://github.com/THUDM/CogVideo.