LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model Paper • 2603.01068 • Published 8 days ago • 19
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale Paper • 2602.23866 • Published 10 days ago • 76
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published 7 days ago • 134
Jr. AI Scientist and Its Risk Report: Autonomous Scientific Exploration from a Baseline Paper Paper • 2511.04583 • Published Nov 6, 2025 • 5
jina-embeddings-v5-text: Task-Targeted Embedding Distillation Paper • 2602.15547 • Published 20 days ago • 26
IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering Paper • 2602.17687 • Published Feb 5 • 1
view article Article A framework and leaderboard for Retrieval Pipelines evaluation on ViDoRe v3 10 days ago • 10
PyVision-RL: Forging Open Agentic Vision Models via RL Paper • 2602.20739 • Published 13 days ago • 29
On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published 13 days ago • 92
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 11 days ago • 86
view article Article Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs Jan 27 • 24
Open Legal Data Collection A collection of our favorite open-source legal datasets on Hugging Face. • 14 items • Updated 6 days ago • 6
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 18 days ago • 479