·
AI & ML interests
LLMSys, LLM, MLSys
Organizations
models
106
HectorHe/gpt-oss-20b-math7k-1epoch-lr4e-5-1e-4-gamma-part2
Updated
HectorHe/Deepseek-V2-13B-Math7K-Expert-Enhance-Fix-Expert-Dense-32-experts
Updated
HectorHe/DeepSeek-V2-Lite-aux-free-sft-commonsense-1epoch-1e-5-gamma-share-expert
Text Generation
•
16B
•
Updated
•
1
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-1e-5-share-expert
Text Generation
•
14B
•
Updated
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-1e-5
Text Generation
•
14B
•
Updated
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-3e-5-share-expert
Text Generation
•
14B
•
Updated
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-3e-5
Text Generation
•
14B
•
Updated
HectorHe/OLMoE-1B-7B-0125-aux-free-sft-commonsense15k-3e-5
Text Generation
•
7B
•
Updated
HectorHe/OLMoE-1B-7B-0125-aux-free-sft-commonsense15k-1e-5
Text Generation
•
7B
•
Updated
HectorHe/DeepSeek-V2-Lite-aux-free-sft-commonsense-1epoch-1e-4-gamma-share-expert
Text Generation
•
16B
•
Updated
•
1
•
1