🔄 In a Training Loop

Gyanateet Dutta

Ryukijano

·

https://ryukijano.github.io

AI & ML interests

Computer Vision, Robotics, Generative modelling, AI for Sciences.

Recent Activity

upvoted a paper about 3 hours ago

UniSurg: A Video-Native Foundation Model for Universal Understanding of Surgical Videos

upvoted a paper about 3 hours ago

PhysVideoGenerator: Towards Physically Aware Video Generation via Latent Physics Guidance

upvoted a paper about 3 hours ago

Action100M: A Large-scale Video Action Dataset

View all activity

Organizations

upvoted 5 papers about 3 hours ago

UniSurg: A Video-Native Foundation Model for Universal Understanding of Surgical Videos

Paper • 2602.05638 • Published Feb 5 • 9

PhysVideoGenerator: Towards Physically Aware Video Generation via Latent Physics Guidance

Paper • 2601.03665 • Published Jan 7 • 1

Action100M: A Large-scale Video Action Dataset

Paper • 2601.10592 • Published Jan 15 • 33

Rethinking JEPA: Compute-Efficient Video SSL with Frozen Teachers

Paper • 2509.24317 • Published Sep 29, 2025 • 12

JEPA-VLA: Video Predictive Embedding is Needed for VLA Models

Paper • 2602.11832 • Published Feb 12 • 1

liked a model about 10 hours ago

nvidia/ArtiFixer

Updated 14 days ago • 127 • 35

liked a model 2 days ago

nvidia/c-fast-foundationstereo

Depth Estimation • Updated 3 days ago • 9

New activity in Ryukijano/CatCon-One-Shot-Controlnet-SD-1-5-b2 5 days ago

Apply for community grant: Personal project (gpu: A100 Large)

#1 opened 5 days ago by

updated a Space 5 days ago

Ryukijano's Project Portfolio

Launch an interactive Streamlit web app

published a Space 5 days ago

Ryukijano's Project Portfolio

Launch an interactive Streamlit web app

liked a model 6 days ago

zjunlp/LabVLA-5B-Base

Robotics • 5B • Updated 9 days ago • 225 • 30

liked a Space 11 days ago

Poseiden

Generate GIFs from fluid dynamics datasets

upvoted a paper 11 days ago

Kairos: A Native World Model Stack for Physical AI

Paper • 2606.16533 • Published 14 days ago • 38

liked a dataset 14 days ago

armand0e/claude-fable-5-claude-code

Traces • Updated 10 days ago • 63 • 12.1k • 245

liked a model 15 days ago

hcltech-robotics/cosmos3-h-surgical-simulator-alpha

Image-to-Video • Updated 15 days ago • 4 • 3

upvoted a collection 15 days ago

VLA-JEPA

VLA-JEPA model checkpoints (LIBERO, Pretrain, SimplerEnv) • 3 items • Updated May 28 • 14

liked 2 Spaces 17 days ago

Gemma Diffusion Website Builder

Watch a diffusion LLM write a website live, then tweak it

World Tracing Demo

Multilayer-geometry 3D from a single image or 16-frame clip

upvoted a paper 17 days ago

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

Paper • 2605.30409 • Published May 28 • 41

liked a model 19 days ago

google/diffusiongemma-26B-A4B-it

Image-Text-to-Text • 26B • Updated 19 days ago • 1.23M • 1.08k