AMAImedia/CodeRM-GRPO-Selection-8B-NOESIS-AWQ-INT4 Text Classification • 8B • Updated 6 days ago • 19 • 1
OpenAssistant/reward-model-electra-large-discriminator Text Classification • Updated Jan 26, 2023 • 50 • 5
OpenAssistant/reward-model-deberta-v3-large-v2 Text Classification • Updated Feb 1, 2023 • 37.2k • • 245