VASC Seminar - Xun Huang

November 3, 2025 3:30pm — 4:30pm

Location:
In Person - Newell-Simon 3305

Speaker:
XUN HUANG , Founder & CEOStealth Startup
http://xunhuang.me/

From Video Generation to Video World Models

Video diffusion models have achieved remarkable success in content creation, yet they still fall short of simulating interactive worlds that respond to users in real time. This talk examines the fundamental challenges preventing these models from evolving into true world simulators. I will present a series of works — CausVid, Self-Forcing, MotionStream, and State-Space World Model — that collectively mark a paradigm shift from non-causal diffusion models to autoregressive–diffusion hybrids capable of streaming long-duration videos with real-time interactivity. These advances move beyond passive video generation toward dynamic, immersive experiences, unlocking new possibilities across gaming, robotics, live video editing, and augmented/virtual reality.
—
Xun Huang was a Research Scientist at Adobe, NVIDIA, as well as an Adjunct Professor at Carnegie Mellon University. He is currently the Founder and CEO of a stealth startup. He obtained his Ph.D. from Cornell University in 2020 under the advisement of Professor Serge Belongie. His doctoral research was recognized with the Fellowship from NVIDIA, Adobe, and Snap. His research interests lie broadly in deep generative models, with a recent focus on video world models.

The VASC seminar is generously sponsored by HeyGen, an all-in-one AI-powered video generation platform that leverages advances in computer vision, generative modeling, and multimodal learning to make high-quality video creation both scalable and accessible.

For More Information:
cdowney@andrew.cmu.edu

Add event to Google
Add event to iCal

About Main page

Admissions Main page

Academics Main page

People Main page

Research Main page

VASC Seminar - Xun Huang

November 3, 2025 3:30pm — 4:30pm