LoRA+: Efficient Low Rank Adaptation of Large Models
Paper
• 2402.12354 • Published
• 7
Large Language Model (LLM) and NLP related papers.
All paper summaries read by Merve
Note HF TRL PR for the GCPO paper implementation: https://github.com/huggingface/trl/pull/2155
Note Code: https://github.com/simplescaling/s1