arxiv:2603.06674
ZhenLin
chuan42
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning upvoted a paper about 1 month ago
Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders