AI & ML interests
AI4Science
Organizations
models
66
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v39__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v40__steps_10000__bs_56__lr_5e7__seed_42
Text Generation
•
3B
•
Updated
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v41__steps_10000__bs_56__lr_5e7__seed_42
Text Generation
•
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v37__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v36__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v38__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v33__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v34__steps_10000__bs_56__lr_5e7__seed_42
8B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v35__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v32__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
3
datasets
10
khuang2/math-query-gen-prompts-w-solution
Viewer
•
Updated
•
50k
•
8
khuang2/math-query-gen-prompts
Viewer
•
Updated
•
50k
•
3
khuang2/Countdown-Tasks-3to4-query-gen-prompts-deepseek-distill
Viewer
•
Updated
•
50k
•
8
khuang2/Countdown-Tasks-3to4-query-gen-prompts-w-hint-w-history
Viewer
•
Updated
•
50k
•
4
khuang2/Countdown-Tasks-3to4-query-gen-prompts-w-hint
Viewer
•
Updated
•
50k
•
3
khuang2/Countdown-Tasks-5to6
Viewer
•
Updated
•
100
•
3
khuang2/Countdown-Tasks-3to4-query-gen-prompts
Viewer
•
Updated
•
50k
•
3
khuang2/Countdown-Tasks-3to4-offline-query-gen-solvable-only__train_query_gen-ckpt_175
Viewer
•
Updated
•
42.4k
•
3
khuang2/Countdown-Tasks-3to4-offline-query-gen-solvable-only
Viewer
•
Updated
•
21.4k
•
3
khuang2/Countdown-Tasks-3to4-offline-query-gen
Viewer
•
Updated
•
49.1k
•
3