CEIA Reinforcement Learning

university

AI & ML interests

None defined yet.

Recent Activity

luanagbmartins updated a model 9 days ago

CEIA-RL/rlaif-energy-scratch-b03

luanagbmartins published a model 9 days ago

CEIA-RL/rlaif-energy-scratch-b03

luanagbmartins updated a model 28 days ago

CEIA-RL/energy-gpt-regulatorio-v2-GRPO-step140-Safety

View all activity

models 16

CEIA-RL/rlaif-energy-scratch-b03

Text Generation • 8B • Updated 9 days ago • 200

CEIA-RL/energy-gpt-regulatorio-v2-GRPO-step140-Safety

Text Generation • 4B • Updated 28 days ago • 30

CEIA-RL/energy-gpt-regulatorio-v2-GRPO

Updated Jun 20 • 8 • 1

CEIA-RL/energyv2-dpo-offline-GRPO

4B • Updated Jun 16 • 157

CEIA-RL/qwen3-4b-dw-lr-SLERP

Text Generation • 4B • Updated Jun 3 • 5

CEIA-RL/qwen3-4b-dw-lr-GRPO-mix-preference

CEIA-RL/qwen3-4b-dw-lr-GRPO

Updated Jun 3 • 22

CEIA-RL/energy-exp1-dpo-offline

Text Generation • 4B • Updated May 31 • 53 • 1

CEIA-RL/energyv2-dpo-offline

Text Generation • 4B • Updated May 30 • 68

CEIA-RL/qwen3-4b-dw-lr-dpo-offline-energy-GRPO

Text Generation • 4B • Updated May 24 • 23

datasets 14

CEIA-RL/energy-eval-all-metrics

Viewer • Updated 29 days ago • 12 • 61

CEIA-RL/energy-eval-filtered_evaluations_v3

Viewer • Updated 30 days ago • 15 • 128

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energyv2-dpo-offline-GRPO_v3

Viewer • Updated 30 days ago • 447 • 49

CEIA-RL/energy-eval-filtered_responses_multichoice_Qwen_Qwen3-4B_v3

Viewer • Updated about 1 month ago • 447 • 60

CEIA-RL/energy-eval-filtered_responses_multichoice_cemig-nlp-releases_enregy-gpt-regulatorio-v2_v3

Viewer • Updated about 1 month ago • 447 • 91

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energyv2-dpo-offline_v3

Viewer • Updated about 1 month ago • 447 • 78

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline-energy-GRPO_v3

Viewer • Updated about 1 month ago • 447 • 35

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energy-exp1-dpo-offline_v3

Viewer • Updated about 1 month ago • 447 • 65

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-GRPO_v3

Viewer • Updated about 1 month ago • 447 • 38

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline-energy_v3

Viewer • Updated about 1 month ago • 447 • 48

View 14 datasets