CEIA Reinforcement Learning

university

AI & ML interests

None defined yet.

Recent Activity

Fazzioni updated a model 2 days ago

CEIA-RL/Qwen3-4B-Instruct-2507-GRPO-GPT-OSS-120B

Fazzioni published a model 2 days ago

CEIA-RL/Qwen3-4B-Instruct-2507-GRPO-GPT-OSS-120B

luanagbmartins updated a model 3 days ago

CEIA-RL/qwen3-4b-dw-lr-hf-dpo

View all activity

spaces 1

LLMasJudgeEval

models 3

CEIA-RL/qwen3-4b-dw-lr-hf-dpo

Text Generation • 4B • Updated 1 day ago • 6.84k

CEIA-RL/Qwen3-4B-Instruct-2507-GRPO-GPT-OSS-120B

Updated 2 days ago • 58

CEIA-RL/qwen3-4b-dw-lr-dpo-offline

Text Generation • 4B • Updated 14 days ago • 615

datasets 12

CEIA-RL/questions-GPT-OSS-120B-RL

Viewer • Updated 4 days ago • 4.3k • 35

CEIA-RL/questions-GPT-OSS-120B

Viewer • Updated 4 days ago • 21.5k • 43

CEIA-RL/Synthetic-Questions-Energy

Viewer • Updated 7 days ago • 18.2k • 31

CEIA-RL/Safety-Questions-Energy

Viewer • Updated 7 days ago • 53.1k • 59

CEIA-RL/synth_regulacao_eng_qa_v0

Viewer • Updated 21 days ago • 2.32k • 30

CEIA-RL/QA-Energy

Viewer • Updated 21 days ago • 43 • 38

CEIA-RL/Nemotron-SFT-Safety-pt-BR-Cleaned

Viewer • Updated 21 days ago • 45.1k • 62

CEIA-RL/hh-rlhf-harmless-base-pt-BR

Viewer • Updated 23 days ago • 44.8k • 37

CEIA-RL/datasets-concat

Viewer • Updated 30 days ago • 172k • 17

CEIA-RL/energy_prompts

Viewer • Updated Feb 27 • 1.56M • 103

View 12 datasets