rghosh8/arc-grpo-nemotron-mini-4b-instruct-seed-42-G-4-REDUCED-modules-layers-beta-0.01-merged 4B • Updated May 6 • 3
rghosh8/arc-grpo-nemotron-mini-4b-instruct-seed-42-G-4-REDUCED-modules-layers-beta-0.01 Text Generation • Updated May 6 • 2
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4-epsilon-high-0.3_merged 7B • Updated May 6 • 3
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4-epsilon-high-0.3 Text Generation • Updated May 6 • 2
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-42-G-4-REDUCED-modules_merged 4B • Updated May 5 • 3
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-42-G-4-REDUCED-modules Text Generation • Updated May 5 • 4
rghosh8/deepseek-llm-7b-chat-opencoder-educational-instruct-seed-42-G-4-REDUCED-LAYERS-new-params_merged 7B • Updated May 5 • 3
rghosh8/deepseek-llm-7b-chat-opencoder-educational-instruct-seed-42-G-4-REDUCED-LAYERS-new-params Text Generation • Updated May 5 • 2
rghosh8/arc-grpo-deepseek-R1-distill-qwen-1.5b-rajat-seed-3407-G-16 Text Generation • Updated May 4 • 1
rghosh8/arc-grpo-deepseek-R1-distill-qwen-1.5b-rajat-seed-3407-G-4 Text Generation • Updated May 4 • 3
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-baseline Text Generation • Updated May 4 • 2
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-42-G-4-REDUCED-LAYERS-2_merged 4B • Updated May 3 • 3
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-42-G-4-REDUCED-LAYERS-2 Text Generation • Updated May 3 • 3
rghosh8/gsm8k-nemotron-mini-4b-instruct-rajat-seed-42-G-4-REDUCED-LAYERS-2_merged 4B • Updated May 2 • 3
rghosh8/gsm8k-nemotron-mini-4b-instruct-rajat-seed-42-G-4-REDUCED-LAYERS-2 Text Generation • Updated May 2 • 5
rghosh8/nemotron-mini-4b-opencoder-educational-instruct-seed-42-G-4-REDUCED-LAYERS-new-params_merged 4B • Updated May 1 • 3
rghosh8/nemotron-mini-4b-instruct-opencoder-educational-instruct-seed-42-G-4-REDUCED-LAYERS-new-params Text Generation • Updated May 1
rghosh8/arc-grpo-Nemotron-Mini-4B-Instruct-rajat-seed-42-G-4-REDUCED-LAYERS_merged 4B • Updated May 1 • 4
rghosh8/arc-grpo-Nemotron-Mini-4B-Instruct-rajat-seed-42-G-4-REDUCED-LAYERS Text Generation • Updated May 1 • 2
rghosh8/gsm8k-Nemotron-Mini-4B-Instruct-rajat-seed-42-G-4-REDUCED-LAYERS_merged 4B • Updated May 1 • 5