Math reasoning models distilled with Importance Weighted OPD.
Yan Xie
YannX
·
AI & ML interests
None yet
Recent Activity
upvoted a paper 7 days ago
On the Position Bias of On-Policy Distillation updated a collection 29 days ago
IW-OPD-math updated a collection 29 days ago
IW-OPD-math