zyf515730395 's Collections Video Generation
updated
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper
• 2506.09113
• Published
• 107
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video
Diffusion
Paper
• 2506.08009
• Published
• 30
Seeing Voices: Generating A-Roll Video from Audio with Mirage
Paper
• 2506.08279
• Published
• 27
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal
Interaction and Enhancement
Paper
• 2506.07848
• Published
• 4
SeedVR2: One-Step Video Restoration via Diffusion Adversarial
Post-Training
Paper
• 2506.05301
• Published
• 59
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video
Diffusion Transformers
Paper
• 2506.00830
• Published
• 7
Video World Models with Long-term Spatial Memory
Paper
• 2506.05284
• Published
• 55
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable
3D Scene Generation
Paper
• 2506.04225
• Published
• 28
IllumiCraft: Unified Geometry and Illumination Diffusion for
Controllable Video Generation
Paper
• 2506.03150
• Published
• 21
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model
Paper
• 2504.08685
• Published
• 130
Any2Caption:Interpreting Any Condition to Caption for Controllable Video
Generation
Paper
• 2503.24379
• Published
• 76
Seedream 3.0 Technical Report
Paper
• 2504.11346
• Published
• 70
JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical
Spatio-Temporal Prior Synchronization
Paper
• 2503.23377
• Published
• 57
Audio-visual Controlled Video Diffusion with Masked Selective State
Spaces Modeling for Natural Talking Head Generation
Paper
• 2504.02542
• Published
• 51
SkyReels-A2: Compose Anything in Video Diffusion Transformers
Paper
• 2504.02436
• Published
• 39
Long-Context Autoregressive Video Modeling with Next-Frame Prediction
Paper
• 2503.19325
• Published
• 73
Wan: Open and Advanced Large-Scale Video Generative Models
Paper
• 2503.20314
• Published
• 59
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
Paper
• 2503.09151
• Published
• 32
ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs
Paper
• 2506.18792
• Published
• 30
VMoBA: Mixture-of-Block Attention for Video Diffusion Models
Paper
• 2506.23858
• Published
• 31
Tora2: Motion and Appearance Customized Diffusion Transformer for
Multi-Entity Video Generation
Paper
• 2507.05963
• Published
• 13
StreamDiT: Real-Time Streaming Text-to-Video Generation
Paper
• 2507.03745
• Published
• 32
Lumos-1: On Autoregressive Video Generation from a Unified Model
Perspective
Paper
• 2507.08801
• Published
• 31
Captain Cinema: Towards Short Movie Generation
Paper
• 2507.18634
• Published
• 42
Omni-Effects: Unified and Spatially-Controllable Visual Effects
Generation
Paper
• 2508.07981
• Published
• 63
Waver: Wave Your Way to Lifelike Video Generation
Paper
• 2508.15761
• Published
• 36
Lumen: Consistent Video Relighting and Harmonious Background Replacement
with Video Generative Models
Paper
• 2508.12945
• Published
• 14
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal
Conditioning
Paper
• 2509.08519
• Published
• 128
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation
Paper
• 2510.02283
• Published
• 96
UniVideo: Unified Understanding, Generation, and Editing for Videos
Paper
• 2510.08377
• Published
• 81
Video-As-Prompt: Unified Semantic Control for Video Generation
Paper
• 2510.20888
• Published
• 50
Uniform Discrete Diffusion with Metric Path for Video Generation
Paper
• 2510.24717
• Published
• 42
LongLive: Real-time Interactive Long Video Generation
Paper
• 2509.22622
• Published
• 188
SANA-Video: Efficient Video Generation with Block Linear Diffusion
Transformer
Paper
• 2509.24695
• Published
• 46
Simulating the Visual World with Artificial Intelligence: A Roadmap
Paper
• 2511.08585
• Published
• 30
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper
• 2512.16093
• Published
• 95
Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation
Paper
• 2512.17040
• Published
• 28
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper
• 2601.00393
• Published
• 131
InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams
Paper
• 2601.02281
• Published
• 33
DreamStyle: A Unified Framework for Video Stylization
Paper
• 2601.02785
• Published
• 24
Yume-1.5: A Text-Controlled Interactive World Generation Model
Paper
• 2512.22096
• Published
• 60
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation
Paper
• 2602.03796
• Published
• 57