profile image

Xintao Wang

Contact Me
I am currently a Senior Staff Researcher at the Kling Team, Kuaishou Technology. I lead a team spanning multimodal video generation (Omni), reinforcement learning for video generation, video generative super-resolution, and next-generation unified multimodal models. Recently, my research has focused on next-state-prediction-style generative video pre-training, aiming to build native unified multimodal foundation models as a step toward world models. Previously, I was a Senior Staff Researcher at Tencent ARC Lab and Tencent AI Lab.
I received my Ph.D. from Multimedia Lab (MMLab), the Chinese University of Hong Kong, advised by Prof. Xiaoou Tang and Prof. Chen Change Loy. I obtained my bachelor's degree from Zhejiang University.

We are Hiring!

We are actively looking for research interns and full-time researchers to work on unified multimodal models, next-state-prediction-style generative video pre-training, multimodal video generation, and RL. Research interns interested in unified multimodal models and generative video pre-training are especially welcome. If you're interested in exploring these opportunities, please reach out to me at xintao.alpha@gmail.com.

News

Selected Publications and Preprints [Full List]

(* equal contribution, # corresponding author)