·
AI & ML interests
LLMSys, LLM, MLSys
Organizations
HectorHe/gpt-oss-20b-math7k-1epoch-lr4e-5-1e-4-gamma-part2
Updated
HectorHe/Deepseek-V2-13B-Math7K-Expert-Enhance-Fix-Expert-Dense-32-experts
Updated
HectorHe/DeepSeek-V2-Lite-aux-free-sft-commonsense-1epoch-1e-5-gamma-share-expert
Text Generation
•
16B
•
Updated
•
1
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-1e-5-share-expert
Text Generation
•
14B
•
Updated
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-1e-5
Text Generation
•
14B
•
Updated
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-3e-5-share-expert
Text Generation
•
14B
•
Updated
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-3e-5
Text Generation
•
14B
•
Updated
HectorHe/OLMoE-1B-7B-0125-aux-free-sft-commonsense15k-3e-5
Text Generation
•
7B
•
Updated
HectorHe/OLMoE-1B-7B-0125-aux-free-sft-commonsense15k-1e-5
Text Generation
•
7B
•
Updated
HectorHe/DeepSeek-V2-Lite-aux-free-sft-commonsense-1epoch-1e-4-gamma-share-expert
Text Generation
•
16B
•
Updated
•
1
•
1
HectorHe/OLMoE-1B-7B-0125-aux-free-sft-commonsense15k-share-expert
Text Generation
•
7B
•
Updated
•
4
•
1
HectorHe/DeepSeek-V2-Lite-aux-free-sft-commonsense-1epoch-1e-4-gamma
Text Generation
•
16B
•
Updated
HectorHe/OLMoE-1B-7B-0125-aux-free-sft-commonsense15k
Text Generation
•
7B
•
Updated
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-share-experts
Text Generation
•
14B
•
Updated
•
1
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free
Text Generation
•
14B
•
Updated
HectorHe/Qwen1.5-MOE-sft-coommonsense15k
Text Generation
•
14B
•
Updated
•
2
•
1
HectorHe/DeepSeek-V2-Lite-sft-commonsense15k
Text Generation
•
16B
•
Updated
•
1
•
1
HectorHe/OLMoE-1B-7B-0125-sft-commonsense15k
Text Generation
•
7B
•
Updated
•
1
HectorHe/OLMoE-1B-7B-0125-sft-commonsense
Updated
HectorHe/OLMoE-1B-7B-0125-sft-code-3epoch-save
7B
•
Updated
•
1
HectorHe/OLMoE-1B-7B-0125-sft-code-3epoch-aux
Updated
HectorHe/OLMoE-1B-7B-0125-sft-math7k-3epoch-save
Text Generation
•
7B
•
Updated
HectorHe/OLMoE-1B-7B-0125-sft-math14k-3epoch-save
Text Generation
•
7B
•
Updated
HectorHe/gpt-oss-20b-math14k-1epoch
Text Generation
•
4.76M
•
Updated
HectorHe/gpt-oss-20b-math7k-1epoch
Text Generation
•
4.76M
•
Updated
HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-part2
Text Generation
•
14B
•
Updated
HectorHe/DeepSeek-V2-Lite-aux-free-sft-math7k-1epoch-1e-4-gamma-share-experts-2nd-epoch-high-bias-expert
Text Generation
•
16B
•
Updated
HectorHe/DeepSeek-V2-Lite-sft-math14k-3epoch
Text Generation
•
16B
•
Updated
HectorHe/DeepSeek-V2-Lite-sft-math14k-1epoch
Text Generation
•
16B
•
Updated
HectorHe/DeepSeek-V2-Lite-aux-free-sft-math7k-1epoch-1e-4-gamma-share-experts-2nd-epoch-lr-1e-6
Text Generation
•
126k
•
Updated
•
1