AI & ML interests
None yet
Organizations
None yet
models
19
MisDrifter/min_judge_seed555135
Text Generation
•
3B
•
Updated
•
1
MisDrifter/min_judge_model
Text Generation
•
3B
•
Updated
•
3
MisDrifter/reward1e5-whole
Text Generation
•
3B
•
Updated
•
3
MisDrifter/qwen2.5_3B_Instruct_rebel_reward_1e5
Text Generation
•
3B
•
Updated
•
4
MisDrifter/qwen2.5_3B_Instruct_rebel_reward_1e4
Text Generation
•
3B
•
Updated
•
1
MisDrifter/qwen2.5_3B_Instruct_rebel_1e5
Text Generation
•
3B
•
Updated
•
1
MisDrifter/qwen2.5_3B_Instruct_rebel_1e4
Text Generation
•
3B
•
Updated
•
1
MisDrifter/verl_rebel_actor
Text Generation
•
1B
•
Updated
•
1
MisDrifter/1e6_rebel_rerun
Text Generation
•
3B
•
Updated
•
1
Text Generation
•
3B
•
Updated
•
1
MisDrifter/iter2_scores_base_0
Viewer
•
Updated
•
10
•
4
MisDrifter/1029_test_soft
Viewer
•
Updated
•
23
•
16
Viewer
•
Updated
•
23
•
3
Viewer
•
Updated
•
23
•
4
MisDrifter/game_stage3_base
Viewer
•
Updated
•
500
•
10
MisDrifter/1019_Qwen__Qwen2.5-1.5B-Instruct
Viewer
•
Updated
•
10
•
17
MisDrifter/1019_Qwen__Qwen2.5-3B-Instruct
Viewer
•
Updated
•
10
•
4
Viewer
•
Updated
•
10
•
4
MisDrifter/1013_chk_mean_maxlenp_1024_beta_1.0_nocheck_tokenized
Viewer
•
Updated
•
21
•
15
MisDrifter/1013_7b_mean_maxlenp_1024_beta_1.0_nocheck_tokenized
Viewer
•
Updated
•
21
•
6