LaViT: Aligning Latent Visual Thoughts for Multi-modal Reasoning Paper • 2601.10129 • Published Jan 15 • 12
Qwen3 Cross-layer Transcoders Collection Cross-layer transcoders for models from the Qwen3 family. • 2 items • Updated Dec 1, 2025 • 1
LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs Paper • 2511.06174 • Published Nov 9, 2025 • 7
SmolLM3 pretraining datasets Collection datasets used in SmolLM3 pretraining • 15 items • Updated Aug 12, 2025 • 45
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published Mar 18, 2025 • 153
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41