Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
CEIA Reinforcement Learning
university
Activity Feed
Follow
7
AI & ML interests
None defined yet.
Recent Activity
Fazzioni
Â
updated
a model
2 days ago
CEIA-RL/Qwen3-4B-Instruct-2507-GRPO-GPT-OSS-120B
Fazzioni
Â
published
a model
2 days ago
CEIA-RL/Qwen3-4B-Instruct-2507-GRPO-GPT-OSS-120B
luanagbmartins
Â
updated
a model
3 days ago
CEIA-RL/qwen3-4b-dw-lr-hf-dpo
View all activity
Team members
5
spaces
1
pinned
Sleeping
Agents
LLMasJudgeEval
🥇
models
3
Sort:Â Recently updated
CEIA-RL/qwen3-4b-dw-lr-hf-dpo
Text Generation
•
4B
•
Updated
1 day ago
•
6.84k
CEIA-RL/Qwen3-4B-Instruct-2507-GRPO-GPT-OSS-120B
Updated
2 days ago
•
58
CEIA-RL/qwen3-4b-dw-lr-dpo-offline
Text Generation
•
4B
•
Updated
14 days ago
•
615
datasets
12
Sort:Â Recently updated
CEIA-RL/questions-GPT-OSS-120B-RL
Viewer
•
Updated
4 days ago
•
4.3k
•
35
CEIA-RL/questions-GPT-OSS-120B
Viewer
•
Updated
4 days ago
•
21.5k
•
43
CEIA-RL/Synthetic-Questions-Energy
Viewer
•
Updated
7 days ago
•
18.2k
•
31
CEIA-RL/Safety-Questions-Energy
Viewer
•
Updated
7 days ago
•
53.1k
•
59
CEIA-RL/synth_regulacao_eng_qa_v0
Viewer
•
Updated
21 days ago
•
2.32k
•
30
CEIA-RL/QA-Energy
Viewer
•
Updated
21 days ago
•
43
•
38
CEIA-RL/Nemotron-SFT-Safety-pt-BR-Cleaned
Viewer
•
Updated
21 days ago
•
45.1k
•
62
CEIA-RL/hh-rlhf-harmless-base-pt-BR
Viewer
•
Updated
23 days ago
•
44.8k
•
37
CEIA-RL/datasets-concat
Viewer
•
Updated
30 days ago
•
172k
•
17
CEIA-RL/energy_prompts
Viewer
•
Updated
Feb 27
•
1.56M
•
103
View 12 datasets