Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation
Hongzhou Zhu, Min Zhao, Guande He, Hang Su, Chongxuan Li, Jun Zhu
2026 · arxiv.org
Open Paper
AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion
Mingzhen Sun, Weining Wang, Gen Li, Jiawei Liu, Jiahui Sun, Wanquan Feng, Shanshan Lao, Siyu Zhou, Qian He, Jing Liu
2025 · Computer Vision and Pattern Recognition
Open Paper
AutoRefiner: Improving Autoregressive Video Diffusion Models via Reflective Refinement Over the Stochastic Sampling Path
Z Yu, A Hayakawa, M Ishii, Q Yu, T Shibuya
2025 · arxiv.org
Open Paper
Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation
Shanchuan Lin, Ceyuan Yang, Hao He, Jianwen Jiang, Yuxi Ren, Xin Xia, Yang Zhao, Xuefeng Xiao, Lu Jiang
2025 · arxiv.org
Open Paper
BlockVid: Block Diffusion for High-Quality and Consistent Minute-Long Video Generation
Z Zhang, S Chang, Y He, Y Han, J Tang
2025 · arxiv.org
Open Paper
Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression
Jung Yi, Wooseok Jang, Paul Hyunbin Cho, Jisu Nam, Heeji Yoon, Seungryong Kim
2025 · arxiv.org
Open Paper
End-to-End Training for Autoregressive Video Diffusion via Self-Resampling
Y Guo, C Yang, H He, Y Zhao, M Wei, Z Yang
2025 · arxiv.org
Open Paper
Generative pre-trained autoregressive diffusion transformer
Y Zhang, J Jiang, G Ma, Z Lu, H Huang, J Yuan
2025 · arxiv.org
Open Paper
InfVSR: Breaking Length Limits of Generic Video Super-Resolution
Z Zhang, K Liu, Z Chen, X Li, Y Chen, B Duan
2025 · arxiv.org
Open Paper
Knot Forcing: Taming Autoregressive Video Diffusion Models for Real-time Infinite Interactive Portrait Animation
S Xiao, XI Zhang, D Meng, Q Wang, P Zhang
2025 · arxiv.org
Open Paper
Live avatar: Streaming real-time audio-driven avatar generation with infinite length
Yubo Huang, Hailong Guo, Fangtai Wu, Shifeng Zhang, Shijie Huang, Qijun Gan, Lin Liu, Sirui Zhao, Enhong Chen, Jiaming Liu, others
2025 · arxiv.org
Open Paper
LongDWM: Cross-Granularity Distillation for Building a Long-Term Driving World Model
X Wang, Z Wu, P Peng
2025 · arxiv.org
Open Paper
LongLive: Real-time Interactive Long Video Generation
Shuai Yang, Wei Huang, Ruihang Chu, Yicheng Xiao, Yuyang Zhao, Xianbang Wang, Muyang Li, Enze Xie, Yingcong Chen, Yao Lu, Song Han, Yukang Chen
2025 · arxiv.org
Open Paper
Lumos-1: On autoregressive video generation from a unified model perspective
H Yuan, W Chen, J Cen, H Yu, J Liang
2025 · arxiv.org
Open Paper
Magicinfinite: Generating infinite talking videos with your words and voice
H Yi, T Ye, S Shao, X Yang, J Zhao, H Guo
2025 · arxiv.org
Open Paper
Matrix-game 2.0: An open-source real-time and streaming interactive world model
Xianglong He, Chunli Peng, Zexiang Liu, Boyang Wang, Yifan Zhang, Qi Cui, Fei Kang, Biao Jiang, Mengyin An, Yangyang Ren, Baixin Xu, Hao-Xiang Guo, Kaixiong Gong, Size Wu, Wei Li, Xuchen Song, Yang Liu, Yangguang Li, Yahui Zhou
2025 · arxiv.org
Open Paper
Memorize-and-Generate: Towards Long-Term Consistency in Real-Time Video Generation
T Zhu, S Zhang, Z Sun, J Tian, Y Tang
2025 · arxiv.org
Open Paper
Memory Forcing: Spatio-Temporal Memory for Consistent Scene Generation on Minecraft
Junchao Huang, Xinting Hu, Boyao Han, Shaoshuai Shi, Zhuotao Tian, Tianyu He, Li Jiang
2025 · arxiv.org
Open Paper
MotionStream: Real-Time Video Generation with Interactive Motion Controls
Joonghyuk Shin, Zhengqi Li, Richard Zhang, Jun-Yan Zhu, Jaesik Park, Eli Shechtman, Xun Huang
2025 · arxiv.org
Open Paper
Playing with Transformer at 30+ FPS via Next-Frame Diffusion
Xinle Cheng, Tianyu He, Jiayi Xu, Junliang Guo, Di He, Jiang Bian
2025 · arxiv.org
Open Paper
RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer
F Du, T Li, Z Zhang, Q Qiao, T Yu, D Zhen, X Jia
2025 · arxiv.org
Open Paper
Real-Time Motion-Controllable Autoregressive Video Diffusion
Kesen Zhao, Jiaxin Shi, Beier Zhu, Junbao Zhou, Xiaolong Shen, Yuan Zhou, Qianru Sun, Hanwang Zhang
2025 · arxiv.org
Open Paper
REST: Diffusion-based Real-time End-to-end Streaming Talking Head Generation via ID-Context Caching and Asynchronous Streaming Distillation
H Wang, Y Weng, X Yu, J Du, H Xu, X Wu, S He
2025 · arxiv.org
Open Paper
Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
Yunhong Lu, Yanhong Zeng, Haobo Li, Hao Ouyang, Qiuyu Wang, Ka Leong Cheng, Jiapeng Zhu, Hengyuan Cao, Zhipeng Zhang, Xing Zhu, others
2025 · arxiv.org
Open Paper
Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
Kunhao Liu, Wenbo Hu, Jiale Xu, Ying Shan, Shijian Lu
2025 · arxiv.org
Open Paper
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Xun Huang, Zhengqi Li, Guande He, Mingyuan Zhou, Eli Shechtman
2025 · arxiv.org
Open Paper
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation
Justin Cui, Jie Wu, Ming Li, Tao Yang, Xiaojie Li, Rui Wang, Andrew Bai, Yuanhao Ban, Cho-Jui Hsieh
2025 · arxiv.org
Open Paper
SkyReels-V2: Infinite-length Film Generative Model
Guibin Chen, Dixuan Lin, Jiangping Yang, Chunze Lin, Junchen Zhu, Mingyuan Fan, Hao Zhang, Sheng Chen, Zheng Chen, Chengcheng Ma, Weiming Xiong, Wei Wang, Nuo Pang, Kang Kang, Zhiheng Xu, Yuzhe Jin, Yupeng Liang, Yubing Song, Peng Zhao, Boyuan Xu, Di Qiu, Debang Li, Zhengcong Fei, Yang Li, Yahui Zhou
2025 · arxiv.org
Open Paper
StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
Z Sun, Z Peng, Y Ma, Y Chen, Z Zhou, Z Zhou
2025 · arxiv.org
Open Paper
Streamdit: Real-time streaming text-to-video generation
A Kodaira, T Hou, J Hou, M Georgopoulos
2025 · arxiv.org
Open Paper
TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models
Chetwin Low, Weimin Wang
2025 · arxiv.org
Open Paper
Taming Teacher Forcing for Masked Autoregressive Video Generation
Deyu Zhou, Quan Sun, Yuang Peng, Kun Yan, Runpei Dong, Duomin Wang, Zheng Ge, Nan Duan, Xiangyu Zhang
2025 · Computer Vision and Pattern Recognition
Open Paper
UniCP: A Unified Caching and Pruning Framework for Efficient Video Generation
Wenzhang Sun, Qirui Hou, Donglin Di, Jiahui Yang, Yongjia Ma, Jianxun Cui
2025 · Proceedings of the 7th ACM International Conference on Multimedia in Asia
Open Paper
VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory
Y Yu, X Wu, X Hu, T Hu, Y Sun, X Lyu, B Wang
2025 · arxiv.org
Open Paper
ViSA: 3D-Aware Video Shading for Real-Time Upper-Body Avatar Creation
F Yang, H Li, P Li, W Yuan, L Qiu, C Song
2025 · arxiv.org
Open Paper
Autoregressive Video Generation without Vector Quantization
Haoge Deng, Ting Pan, Haiwen Diao, Zhengxiong Luo, Yufeng Cui, Huchuan Lu, Shiguang Shan, Yonggang Qi, Xinlong Wang
2024 · arxiv.org
Open Paper
Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion
Boyuan Chen, Diego Marti Monso, Yilun Du, Max Simchowitz, Russ Tedrake, Vincent Sitzmann
2024 · Neural Information Processing Systems
Open Paper
Diffusion Models Are Real-Time Game Engines
Dani Valevski, Yaniv Leviathan, Moab Arar, Shlomi Fruchter
2024 · International Conference on Learning Representations
Open Paper
FIFO-Diffusion: Generating Infinite Videos from Text without Training
Jihwan Kim, Junoh Kang, Jinyoung Choi, Bohyung Han
2024 · Neural Information Processing Systems
Open Paper
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
Tianwei Yin, Qiang Zhang, Richard Zhang, William T. Freeman, Frédo Durand, Eli Shechtman, Xun Huang
2024 · Computer Vision and Pattern Recognition
Open Paper
From Slow Bidirectional to Fast Causal Video Generators
Tianwei Yin, Qiang Zhang, Richard Zhang, William T Freeman, Fredo Durand, Eli Shechtman, Xun Huang
2024 · arxiv.org
Open Paper
Looking backward: Streaming video-to-video translation with feature banks
F Liang, A Kodaira, C Xu, M Tomizuka
2024 · arxiv.org
Open Paper
Streaming video diffusion: Online video editing with diffusion models
F Chen, Z Yang, B Zhuang, Q Wu
2024 · arxiv.org
Open Paper
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Roberto Henschel, Levon Khachatryan, Hayk Poghosyan, Daniil Hayrapetyan, Vahram Tadevosyan, Zhangyang Wang, Shant Navasardyan, Humphrey Shi
2024 · Computer Vision and Pattern Recognition
Open Paper