Does VLA Even Know the Basics? Measuring Commonsense and World Knowledge Retention in Vision-Language-Action Models Paper • 2606.19297 • Published 19 days ago • 69
Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines? Paper • 2602.14111 • Published Feb 15 • 56
KAGE-Bench: Fast Known-Axis Visual Generalization Evaluation for Reinforcement Learning Paper • 2601.14232 • Published Jan 20 • 9
Don't Blind Your VLA: Aligning Visual Representations for OOD Generalization Paper • 2510.25616 • Published Oct 29, 2025 • 107
AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment Paper • 2506.04089 • Published Jun 4, 2025 • 47