Edit Models filters
Apps
Inference Providers
Active filters: QuestionAnswering
JamieAi33/Phi-2-QLora
JamieAi33/Phi-2_PEFT
KakashiH/BashExplainer_Gemma
2KKLabs/Kaleidoscope_small_v1
2KKLabs/Kaleidoscope_large_v1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Ins
Reinforcement Learning • 8B • Updated
• 2 • 2
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Base
Reinforcement Learning • 8B • Updated
• 1 • 2
SEGAgentRL/LLDS-A-GSPO-Qwen2.5-3B-Ins
Reinforcement Learning • 3B • Updated
• 1 • 1
SEGAgentRL/LLDS-R-GSPO-Qwen2.5-3B-Ins
Reinforcement Learning • 3B • Updated
• 1
SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Base
Reinforcement Learning • 3B • Updated
• 2 • 1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Base-MA
Reinforcement Learning • 3B • Updated
• 1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Base
Reinforcement Learning • 3B • Updated
• 1
SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Ins
Reinforcement Learning • 3B • Updated
• 1
mradermacher/LLDS-A-GSPO-Qwen2.5-3B-Ins-GGUF
3B • Updated
• 17
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Base-GGUF
8B • Updated
• 35 • 1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Ins
Reinforcement Learning • 3B • Updated
• 3
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Base-i1-GGUF
8B • Updated
• 610 • 2
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Ins-GGUF
8B • Updated
• 13 • 1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Base-GGUF
3B • Updated
• 8
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Ins-i1-GGUF
8B • Updated
• 22 • 1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Ins-GGUF
3B • Updated
• 19
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Ins-GGUF
3B • Updated
• 224 • 1
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Base-GGUF
3B • Updated
• 389 • 1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Base-MA-GGUF
3B • Updated
• 367 • 1
mradermacher/LLDS-R-GSPO-Qwen2.5-3B-Ins-GGUF
3B • Updated
• 687 • 1
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Base-i1-GGUF
3B • Updated
• 178 • 1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Base-i1-GGUF
3B • Updated
• 73
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Ins-i1-GGUF
Updated
• 33
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Ins-i1-GGUF
Updated
• 183 • 2