Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents Paper • 2605.29447 • Published 14 days ago • 20
PlatonicNav: Unveiling Semantic Correspondence in Navigation with Platonic Topological Maps Paper • 2606.01788 • Published 10 days ago • 9
One-Forcing: Towards Stable One-Step Autoregressive Video Generation Paper • 2605.23458 • Published 20 days ago • 7
ijinyu1113/ft_mr7_410m_seed42_lr3e-5_wd0.05_oldcfg300ep_modarith_subtract_max500_evalevery100_purenum Updated 8 days ago • 1
The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement Paper • 2605.30888 • Published 13 days ago • 10
CoRL2026-CSI/IsaacLab-SO101-Phase1-pick_place-80episode-10fps Viewer • Updated 9 days ago • 25.3k • 46 • 1
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 15 days ago • 423
Decoupling Communication from Policy: Robust MARL under Bandwidth Constraints Paper • 2605.21085 • Published 22 days ago • 5
EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation Paper • 2605.23271 • Published 20 days ago • 80
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published about 1 month ago • 195
ModelLens: Finding the Best for Your Task from Myriads of Models Paper • 2605.07075 • Published May 8 • 15
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published May 7 • 233