SIRI: Scaling Iterative Reinforcement Learning with Interleaved Compression Paper • 2509.25176 • Published Sep 29, 2025 • 14
Defending Against Malicious Finetuning by Scaling Train-time Adversarial Attacks Paper • 2606.07970 • Published 6 days ago