VASC Seminar - Xun Huang November 3, 2025 3:30pm — 4:30pm Location: In Person - Newell-Simon 3305 Speaker: XUN HUANG , Founder & CEOStealth Startup http://xunhuang.me/ From Video Generation to Video World Models Video diffusion models have achieved remarkable success in content creation, yet they still fall short of simulating interactive worlds that respond to users in real time. This talk examines the fundamental challenges preventing these models from evolving into true world simulators. I will present a series of works — CausVid, Self-Forcing, MotionStream, and State-Space World Model — that collectively mark a paradigm shift from non-causal diffusion models to autoregressive–diffusion hybrids capable of streaming long-duration videos with real-time interactivity. These advances move beyond passive video generation toward dynamic, immersive experiences, unlocking new possibilities across gaming, robotics, live video editing, and augmented/virtual reality.—Xun Huang was a Research Scientist at Adobe, NVIDIA, as well as an Adjunct Professor at Carnegie Mellon University. He is currently the Founder and CEO of a stealth startup. He obtained his Ph.D. from Cornell University in 2020 under the advisement of Professor Serge Belongie. His doctoral research was recognized with the Fellowship from NVIDIA, Adobe, and Snap. His research interests lie broadly in deep generative models, with a recent focus on video world models.The VASC seminar is generously sponsored by HeyGen, an all-in-one AI-powered video generation platform that leverages advances in computer vision, generative modeling, and multimodal learning to make high-quality video creation both scalable and accessible. For More Information: cdowney@andrew.cmu.edu Add event to Google Add event to iCal