MAGVIT: Masked Generative Video Transformer We introduce MAGVIT to tackle various video synthesis tasks with a single model, where we demonstrate its quality, efficiency, and flexibility.

(Unmute for narrations) Youtube Bilibili


Inspirational Applications

(Click each to expand)


Acknowledgements

Paper authors: Lijun Yu‡†, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa†+, and Lu Jiang
Carnegie Mellon University, Google Research, +Georgia Institute of Technology
lijun@cmu.edu, lujiang@google.com

Web design: Lijun Yu, Freelancer Jekyll theme

Thanks to Tom Duerig, Victor Gomes, Paul Natsev, David Salesin, Jay Yagnik, Tomas Izo, Rahul Sukthankar, Wolfgang Macherey, David Alexander Ross, Yu-Chuan Su, Sarah Laszlo, Hugh Williams, Bryan Seybold, Albert Shaw, Jonathan Ho, Tim Salimans, Wenhe liu, Xinyu Yao, Mingzhi Cai, Yizhi Zhang, Zhao Jin, Zhiruo Zora Wang, and the Multipod committee and Scenic team.