Human Psychometric Questionnaires Mischaracterize LLM Behavior Paper • 2509.10078 • Published 11 days ago • 28
RobotValues: Evaluating Household Robots When Human Values Conflict Paper • 2606.03312 • Published 7 days ago • 25
RobotValues: Evaluating Household Robots When Human Values Conflict Paper • 2606.03312 • Published 7 days ago • 25
ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time? Paper • 2606.05553 • Published 5 days ago • 45
Where Should Diffusion Enter a Language Model? Geometry-Guided Hidden-State Replacement Paper • 2605.14368 • Published 26 days ago • 16
Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States Paper • 2605.07579 • Published May 8 • 18
KL for a KL: On-Policy Distillation with Control Variate Baseline Paper • 2605.07865 • Published May 8 • 22
ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding Paper • 2510.00546 • Published Apr 20 • 14 • 3
ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding Paper • 2510.00546 • Published Apr 20 • 14
view article Article How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons sherryxychen • Sep 30, 2025 • 71
π_0: A Vision-Language-Action Flow Model for General Robot Control Paper • 2410.24164 • Published Oct 31, 2024 • 31
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation • 71B • Updated Apr 13, 2025 • 13.5k • • 2.07k