daviddavidlu/DAPO-with-prompt-augmentation-step2820 Text Generation • 2B • Updated 29 days ago • 164 •
daviddavidlu/DAPO-with-prompt-augmentation-step2480 Text Generation • 2B • Updated 29 days ago • 150 •
daviddavidlu/DAPO-with-prompt-augmentation-step2720 Text Generation • 2B • Updated 29 days ago • 155 •
daviddavidlu/PrAg-PO-DeepSeek-R1-Distill-Qwen-1.5B-step1100 Text Generation • 2B • Updated 29 days ago • 44
daviddavidlu/PrAg-PO-DeepSeek-R1-Distill-Qwen-1.5B-step1160 Text Generation • 2B • Updated 29 days ago • 45
daviddavidlu/PrAg-PO-DeepSeek-R1-Distill-Qwen-1.5B-step1160 Text Generation • 2B • Updated 29 days ago • 45
daviddavidlu/PrAg-PO-DeepSeek-R1-Distill-Qwen-1.5B-step1100 Text Generation • 2B • Updated 29 days ago • 44
daviddavidlu/DAPO-with-prompt-augmentation-step2480 Text Generation • 2B • Updated 29 days ago • 150 •
daviddavidlu/DAPO-with-prompt-augmentation-step2720 Text Generation • 2B • Updated 29 days ago • 155 •
Prompt Augmentation Scales up GRPO Training on Mathematical Reasoning Paper • 2602.03190 • Published Feb 3 • 1
daviddavidlu/DAPO-with-prompt-augmentation-step2820 Text Generation • 2B • Updated 29 days ago • 164 •