arxiv:2605.27811
Xingdong Zuo
zuoxingdong
AI & ML interests
Reinforcement Learning, Robotics
Recent Activity
updated a dataset 5 days ago
zuoxingdong/lekiwi-blog-assets authored a paper 19 days ago
HyperCLOVA X Technical Report authored a paper 19 days ago
Direct Preference-based Policy Optimization without Reward Modeling