Haruka Takahashi
harukatakahashi
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 months ago
ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning upvoted a paper about 2 months ago
Heterogeneous Agent Collaborative Reinforcement Learning upvoted a paper 11 months ago
Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety
AssuranceOrganizations
None yet