Yinxu Pan

cppowboy

AI & ML interests

RL for LLM, Code&Math Reasoning, Function Calling, Code Interpreter, Vision-Language Pretraining

Recent Activity

upvoted a paper about 14 hours ago
Self-Distilled Policy Gradient
liked a dataset 7 days ago
openbmb/Ultra-FineWeb-L3
View all activity

Organizations

Diffusers Pipelines Library for Stable Diffusion's profile picture OpenBMB's profile picture XAgentCommunity's profile picture