MAGVIT: Masked Generative Video Transformer
Lijun Yu‡†, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang,
Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa†+, Lu Jiang
Carnegie Mellon University, Google Research, +Georgia Institute of Technology
CVPR 2023 (Highlight)

We introduce MAGVIT to tackle various video synthesis tasks with a single model, where we demonstrate its quality, efficiency, and flexibility.

(Unmute for narrations) Youtube Bilibili


Inspirational Applications

(Click each to expand)


Acknowledgements

Web design: Lijun Yu, Freelancer Jekyll theme

Thanks to Tom Duerig, Victor Gomes, Paul Natsev, David Salesin, Jay Yagnik, Tomas Izo, Rahul Sukthankar, Wolfgang Macherey, David Alexander Ross, Yu-Chuan Su, Sarah Laszlo, Hugh Williams, Bryan Seybold, Albert Shaw, Jonathan Ho, Tim Salimans, Wenhe liu, Xinyu Yao, Mingzhi Cai, Yizhi Zhang, Zhao Jin, Zhiruo Zora Wang, and the Multipod committee and Scenic team.