Om AI Lab

company

https://github.com/om-ai-lab

AI & ML interests

Multimodal AI, Agents

Recent Activity

tianchez submitted a paper about 15 hours ago

Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models

kyusonglee updated a model about 2 months ago

omlab/opentrackvla-qwen06b

Zilun updated a dataset 4 months ago

omlab/SARDet_REC6_NORM-FS

View all activity

Papers

Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models

VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs

View all Papers

Articles

Trials, Errors, and Breakthroughs: Our Rocky Road to OVD SOTA with Reinforcement Learning

Improving Object Detection through Reinforcement Learning with VLM-R1

omlab 's datasets 12

omlab/SARDet_REC6_NORM-FS

Viewer • Updated Feb 4 • 968 • 16

omlab/SARDet_REC6-FS

Viewer • Updated Feb 4 • 968 • 6

omlab/SARDet3-FS

Viewer • Updated Feb 1 • 270 • 20

omlab/Cross_DIOR-RSVG

Viewer • Updated Oct 2, 2025 • 7.42k • 67

omlab/Cross_RRSIS-D

Viewer • Updated Oct 2, 2025 • 3.48k • 24

omlab/VRSBench-FS

Viewer • Updated Oct 2, 2025 • 16.6k • 295 • 1

omlab/NWPU-FS

Viewer • Updated Oct 2, 2025 • 39 • 33

omlab/EarthReason-FS

Viewer • Updated Oct 2, 2025 • 3.39k • 14

omlab/VLM-R1

Preview • Updated Apr 23, 2025 • 368 • 18

omlab/RS5M

Viewer • Updated Mar 16, 2025 • 7.25M • 2.6k • 1

omlab/zoom_eye_data

Viewer • Updated Jan 1, 2025 • 591 • 17

omlab/OVDEval

Updated Dec 26, 2023 • 214 • 3