BadCat
Foresta
ยท
AI & ML interests
LLMs
Deep learning
Reinforcement learning
Recent Activity
upvoted a paper about 17 hours ago
On the Geometry of On-Policy Distillation upvoted a paper about 1 month ago
A^2TGPO: Agentic Turn-Group Policy Optimization with Adaptive Turn-level Clipping liked a Space 2 months ago
duoan/TorchCodeOrganizations
None yet