greattkiffy 's Collections ToBReviewed
updated
Personalize Anything for Free with Diffusion Transformer
Paper
• 2503.12590
• Published
• 44
R1-VL: Learning to Reason with Multimodal Large Language Models via
Step-wise Group Relative Policy Optimization
Paper
• 2503.12937
• Published
• 30
Exploring the Vulnerabilities of Federated Learning: A Deep Dive into
Gradient Inversion Attacks
Paper
• 2503.11514
• Published
• 18
Agentic Reward Modeling: Integrating Human Preferences with Verifiable
Correctness Signals for Reliable Reward Systems
Paper
• 2502.19328
• Published
• 23
GenPRM: Scaling Test-Time Compute of Process Reward Models via
Generative Reasoning
Paper
• 2504.00891
• Published
• 14
Advances and Challenges in Foundation Agents: From Brain-Inspired
Intelligence to Evolutionary, Collaborative, and Safe Systems
Paper
• 2504.01990
• Published
• 303
HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World
Hallucination Detection
Paper
• 2505.00506
• Published
MLLM-as-a-Judge for Image Safety without Human Labeling
Paper
• 2501.00192
• Published
• 31
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation
Sandbox for Deep Research
Paper
• 2505.19253
• Published
• 34
Surfer-H Meets Holo1: Cost-Efficient Web Agent Powered by Open Weights
Paper
• 2506.02865
• Published
• 33
LLMalMorph: On The Feasibility of Generating Variant Malware using
Large-Language-Models
Paper
• 2507.09411
• Published
• 4
F1: A Vision-Language-Action Model Bridging Understanding and Generation
to Actions
Paper
• 2509.06951
• Published
• 32