19 43 58

xiangan

https://anxiangsir.github.io/

anxiangsir

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

Qwen/Qwen3.5-4B

upvoted a changelog 1 day ago

Public Storage Add-ons

upvoted a collection 10 days ago

onevision-encoder

View all activity

Organizations

liked a model 1 day ago

Qwen/Qwen3.5-4B

Image-Text-to-Text • 5B • Updated 1 day ago • 28.4k • 173

upvoted a changelog 1 day ago

Hugging Face Changelog

Public Storage Add-ons

5 days ago

• 86

upvoted a collection 10 days ago

onevision-encoder

Collection

2 items • Updated 22 days ago • 6

published a dataset 11 days ago

lmms-lab-encoder/60s_tem_grounding_ov2_codec_100k

Updated 11 days ago • 48

published a dataset 12 days ago

lmms-lab-encoder/60s_20260215_154644_ov2_codec_1w

Updated 12 days ago • 9

upvoted a paper 13 days ago

UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

Paper • 2602.12279 • Published 19 days ago • 19

authored a paper 14 days ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Paper • 2602.08683 • Published 22 days ago • 49

upvoted a paper 15 days ago

CoPE-VideoLM: Codec Primitives For Efficient Video Language Models

Paper • 2602.13191 • Published 18 days ago • 29

updated a collection 16 days ago

OneVision-Encoder

Collection

2 items • Updated 16 days ago

upvoted a paper 16 days ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Paper • 2602.08683 • Published 22 days ago • 49

updated a dataset 17 days ago

lmms-lab-encoder/wd_temporal_grounding_frames_max_64_max_448x448_pixels_with_fps

Updated 17 days ago • 142

published a dataset 17 days ago

lmms-lab-encoder/wd_temporal_grounding_frames_max_64_max_448x448_pixels_with_fps

Updated 17 days ago • 142

upvoted a paper 19 days ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published 19 days ago • 57

authored 2 papers 20 days ago

ProCLIP: Progressive Vision-Language Alignment via LLM-based Embedder

Paper • 2510.18795 • Published Oct 21, 2025 • 11

DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset

Paper • 2601.10305 • Published Jan 15 • 36

updated a model 22 days ago

lmms-lab-encoder/onevision-encoder-large-lang

Updated 22 days ago • 155 • 8

updated a collection 22 days ago

OneVision-Encoder

Collection

HEVC-Style Vision Transformer • 2 items • Updated 22 days ago • 3

updated a model 26 days ago

lmms-lab-encoder/ov2-2b-2026-02-04-64frames-temporal_grounding

2B • Updated 26 days ago • 14

published a model 26 days ago

lmms-lab-encoder/ov2-2b-2026-02-04-64frames-temporal_grounding

2B • Updated 26 days ago • 14

liked a model about 1 month ago

lmms-lab-encoder/onevision-encoder-large-lang

Updated 22 days ago • 155 • 8

xiangan

AI & ML interests

Recent Activity

Organizations

xiangan's activity

Public Storage Add-ons