arxiv:2512.22905
Hao Fei
scofield7419
AI & ML interests
Multimodal Learning, Large Language Model, Vision and Language, Natural Language Processing, Structural Modeling
Recent Activity
upvoted a paper about 12 hours ago
JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation liked
a dataset about 1 month ago
UniVA-Agent/UniVA-Bench authored
a paper
about 2 months ago
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation