Math reasoning models distilled with Importance Weighted OPD.
Yan Xie
YannX
·
AI & ML interests
None yet
Recent Activity
upvoted a paper 5 days ago
On the Position Bias of On-Policy Distillation updated a collection 28 days ago
IW-OPD-math updated a collection 28 days ago
IW-OPD-math