Submitted by akhaliq 447 DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning DeepSeek 92k 10
Submitted by akhaliq 91 VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding · 15 authors 1.14k 6
Submitted by akhaliq 74 FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces · 10 authors 1.13k 3
Submitted by yaful 61 Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback · 5 authors 182 2
Submitted by akhaliq 28 O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning · 9 authors 98 2
Submitted by RicardoL1u 19 Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament · 6 authors 14 3
Submitted by jedyang97 17 Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass · 9 authors 1.53k 5
Submitted by Eladlev 13 IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI Systems Plurai 1.17k 2