payelb/UltraFeedback_openbmb_reward-model-deberta-v3-base_1k_fixed_MARS_semantic_refined Text Classification • 0.2B • Updated about 24 hours ago • 31
payelb/PKUSafeRLHF_roberta-base_1k_fixed_MARS_semantic_refined Text Classification • 0.1B • Updated 3 days ago • 121
payelb/PKUSafeRLHF_reward-model-deberta-v3-base_1k_fixed_MARS_semantic_refined Text Classification • 0.2B • Updated 3 days ago • 48
payelb/HHRLHF_reward-model-deberta-v3-base_1k_fixed_MARS_semantic_distance_synth Text Classification • 0.2B • Updated 3 days ago • 19
payelb/HHRLHF_reward-model-deberta-v3-base_1k_fixed_MARS_semantic_refined_aug26 Text Classification • 0.2B • Updated 3 days ago • 63
payelb/HHRLHF_roberta-base_1k_fixed_MARS_semantic_refined_aug26 Text Classification • 0.1B • Updated 3 days ago • 38
payelb/PKUSafeRLHF_roberta-large_1k_fixed_baseline Text Classification • 0.4B • Updated 16 days ago • 31
payelb/PKUSafeRLHF_roberta-large_1k_fixed_noaug Text Classification • 0.4B • Updated 16 days ago • 25
payelb/UltraFeedback_openbmb_roberta-large_1k_fixed_MARS Text Classification • 0.4B • Updated 17 days ago • 81
payelb/UltraFeedback_openbmb_roberta-large_1k_fixed_WoN Text Classification • 0.4B • Updated 17 days ago • 48
payelb/UltraFeedback_openbmb_roberta-large_1k_fixed_baseline Text Classification • 0.4B • Updated 17 days ago • 49
payelb/UltraFeedback_openbmb_roberta-large_1k_fixed_noaug Text Classification • 0.4B • Updated 17 days ago • 53
payelb/PKUSafeRLHF_roberta-base_6k_fixed_adaboost_margin_noaug Text Classification • 0.1B • Updated 22 days ago • 33
payelb/HHRLHF_roberta-base_6k_fixed_adaboost_margin_noaug Text Classification • 0.1B • Updated 22 days ago • 29