-
-
-
-
-
-
Inference Providers
Active filters:
gsm8k
Text Generation
•
Updated
•
24
•
3
August4293/mistral_gsm8k_ssl_it1
Updated
August4293/mistral_gsm8k_ssl_it2
Updated
Text Generation
•
Updated
•
22
•
mradermacher/Qwen-0.5B-GRPO-GGUF
0.5B
•
Updated
•
51
mradermacher/prem-1B-grpo-GGUF
Reinforcement Learning
•
1B
•
Updated
•
30
yeok/DeepScaleR-1.5B-Preview-GSM8K-Demo
2B
•
Updated
•
6
LahiruWije/Qwen2.5-0.5B-Instruct-GPRO-GSM8K
Question Answering
•
0.5B
•
Updated
•
9
eagle0504/qwen-2-5-3b-instruct-using-openai-gsm8k-gguf-data-enhanced-with-deepseek-v3-small
3B
•
Updated
•
294
eagle0504/qwen-2-5-3b-instruct-using-openai-gsm8k-data-enhanced-with-deepseek-v3
3B
•
Updated
•
157
eagle0504/qwen-2-5-3b-instruct-using-openai-gsm8k-data-enhanced-with-deepseek-v4
3B
•
Updated
•
79
Text Generation
•
Updated
•
1
•
1
koolkarni-Atharva10/Nano_R1
Reinforcement Learning
•
Updated
klei1/bleta-logjike-27b-gguf
27B
•
Updated
•
7
faxnoprinter/OpenELM-450M-gsm8k-LoRA
darshjoshi16/phi2-lora-math
Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl
Reinforcement Learning
•
2B
•
Updated
•
74
•
2
Text Generation
•
0.6B
•
Updated
•
74
•
2
shivs28/jee_nujan_mix_v2_base
Text Generation
•
2B
•
Updated
•
6
tahamajs/Qwen3-4B-GSM8k-GRPO-Unsloth
4B
•
Updated
•
5
tahamajs/gemma-3-1b-it-finetune-gsmk8
Text Generation
•
1.0B
•
Updated
•
5
TroglodyteDerivations/smol_lm_3b
Updated
safouaneelg/Apertus-8B-Instruct-2509-GSM8k-SFT
Text Generation
•
8B
•
Updated
•
8
kotekjedi/qwen3-32b-lora-jailbreak-detection-merged
Text Generation
•
33B
•
Updated
•
5
yassine-boua/olmo-gsm8k-finetuned
Text Generation
•
Updated
•
3
kotekjedi/qwen3-32b-lora-jailbreak-detection-merged_v2
Text Generation
•
33B
•
Updated
•
7
mradermacher/qwen3-32b-lora-jailbreak-detection-merged_v2-GGUF
33B
•
Updated
•
101
karthik/verl-qwen2.5-0.5b-gsm8k-ppo-step360
Text Generation
•
0.5B
•
Updated
•
5
DeryFerd/Qwen2.5-Math-7B-Instruct-Distill-Phi2-2.5K-MixMath
Text Generation
•
3B
•
Updated
•
11
•
1
DeryFerd/Qwen2.5-Math-Coder-Distill-Phi-2-4.4K-MixMathCode
Text Generation
•
3B
•
Updated
•
18
•
4