Jialiang Cheng
Julius-L
·
AI & ML interests
None yet
Recent Activity
upvoted a paper about 11 hours ago
SERE: Similarity-based Expert Re-routing for Efficient Batch Decoding in MoE Models upvoted a paper about 11 hours ago
EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models authored
a paper
15 days ago
SERE: Similarity-based Expert Re-routing for Efficient Batch Decoding in MoE Models