Submitted by zstanjj 71 HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems · 6 authors 464 23
Submitted by DogyunPark 22 LLaMo: Large Language Model-based Molecular Graph Assistant · 4 authors 36 1
Submitted by prlz77 18 Controlling Language and Diffusion Models by Transporting Activations Apple 62 2
Submitted by Yang130 14 DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution · 8 authors 126 2
Submitted by Ksgk-fy 13 Adaptive Length Image Tokenization via Recurrent Allocation · 4 authors 148 1
Submitted by LiquidAmmonia 11 DreamPolish: Domain Score Distillation With Progressive Geometry Generation · 8 authors 2
Submitted by xiaojin66 9 GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details · 9 authors 1
Submitted by Ksgk-fy 7 Inference Optimal VLMs Need Only One Visual Token but Larger Models · 4 authors 47 1
Submitted by ksoman 6 Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge · 8 authors 15 1
Submitted by mbar0075 4 Correlation of Object Detection Performance with Visual Saliency and Depth Estimation · 2 authors 1 1