hazyresearch/ncm-tokenized-datasets
hazyresearch/OT_8K_seed_all_responses
Viewer
•
Updated
•
396k
•
4
hazyresearch/Arena-Hard-Auto-raw-minimal-v0.1
Viewer
•
Updated
•
750
•
23
hazyresearch/naturalreasoning_balanced_gpt5_09.11
Viewer
•
Updated
•
1.32k
•
7
hazyresearch/wildchat_balanced_gpt5_09.11
Viewer
•
Updated
•
1.68k
•
1
hazyresearch/m07d28_niah_synthesize_llama-3.2-3b_n65536_k1-1
Viewer
•
Updated
•
65.5k
•
12
hazyresearch/m07d28_niah_synthesize_llama-3.2-3b_n65536_k1-0
Viewer
•
Updated
•
65.5k
•
10
hazyresearch/m07d28_mtob_synthesize_qwen3-4b_n65536-1
Viewer
•
Updated
•
65.5k
•
17
hazyresearch/m07d28_mtob_synthesize_qwen3-4b_n65536-0
Viewer
•
Updated
•
65.5k
•
15
hazyresearch/m07d28_mtob_synthesize_llama-3.2-3b_n65536-1
Viewer
•
Updated
•
65.5k
•
18
hazyresearch/m07d28_mtob_synthesize_llama-3.2-3b_n65536-0
Viewer
•
Updated
•
65.5k
•
14
hazyresearch/m07d11_longhealth_synthesize_qwen3-4b_p10_n65536-1
Viewer
•
Updated
•
65.5k
•
21
hazyresearch/m07d11_longhealth_synthesize_qwen3-4b_p10_n65536-0
Viewer
•
Updated
•
65.5k
•
46
hazyresearch/m07d11_longhealth_synthesize_llama-3.2-3b_p10_n65536-2
Viewer
•
Updated
•
65.5k
•
15
hazyresearch/m07d11_longhealth_synthesize_llama-3.2-3b_p10_n65536-1
Viewer
•
Updated
•
65.5k
•
15
hazyresearch/m07d11_longhealth_synthesize_llama-3.2-3b_p10_n65536-0
Viewer
•
Updated
•
65.5k
•
26
Viewer
•
Updated
•
8.19k
•
6
Viewer
•
Updated
•
8.19k
•
6
Viewer
•
Updated
•
128
•
7
•
1
hazyresearch/arxiv_synthesize_eval_gpt-5-mini-2025-08-07_n32-0
Viewer
•
Updated
•
32
•
69
hazyresearch/arxiv_synthesize_qwen-qwen3-4b_n8192-0
Viewer
•
Updated
•
8.19k
•
21
hazyresearch/MATH500_with_Llama_3.1_8B_Instruct_v1
Viewer
•
Updated
•
500
•
6
hazyresearch/GPQA_with_Llama_3.1_8B_Instruct_v1
Viewer
•
Updated
•
646
•
45
hazyresearch/MMLU_with_Llama_3.1_8B_Instruct_v1
Viewer
•
Updated
•
719
•
32
hazyresearch/MMLU-Pro_with_Llama_3.1_8B_Instruct_v1
Viewer
•
Updated
•
500
•
34
hazyresearch/MMLU-Pro_with_Llama_3.1_70B_Instruct_v1
Viewer
•
Updated
•
500
•
53
hazyresearch/MMLU_with_Llama_3.1_70B_Instruct_v1
Viewer
•
Updated
•
719
•
21
hazyresearch/GPQA_with_Llama_3.1_70B_Instruct_v1
Viewer
•
Updated
•
646
•
54
hazyresearch/MATH500_with_Llama_3.1_70B_Instruct_v1
Viewer
•
Updated
•
500
•
164
hazyresearch/MATH-500_with_Llama_3.1_8B_Instruct_v1
Viewer
•
Updated
•
500
•
67