DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects Paper • 2606.15133 • Published 8 days ago • 64
Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance Paper • 2606.19195 • Published 4 days ago • 103
PhoneHarness: Harnessing Phone-Use Agents through Mixed GUI, CLI, and Tool Actions Paper • 2606.14832 • Published 9 days ago • 12
Guava: An Effective and Universal Harness for Embodied Manipulation Paper • 2606.18363 • Published 5 days ago • 27
Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games Paper • 2606.19338 • Published 4 days ago • 43
MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction Paper • 2606.18558 • Published 4 days ago • 42
ENPIRE: Agentic Robot Policy Self-Improvement in the Real World Paper • 2606.19980 • Published 3 days ago • 7
S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence Paper • 2606.20515 • Published 3 days ago • 31
Learning from the Self-future: On-policy Self-distillation for dLLMs Paper • 2606.18195 • Published 5 days ago • 70
LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling Paper • 2606.18023 • Published 5 days ago • 168
Reinforcing Dual-Path Reasoning in Spatial Vision Language Models Paper • 2606.17539 • Published 5 days ago • 14
RF-DETR: Neural Architecture Search for Real-Time Detection Transformers Paper • 2511.09554 • Published Nov 12, 2025 • 12
Ultralytics YOLO26: Unified Real-Time End-to-End Vision Models Paper • 2606.03748 • Published 19 days ago • 13
Fara-7B: An Efficient Agentic Model for Computer Use Paper • 2511.19663 • Published Nov 24, 2025 • 22
PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training Paper • 2606.03264 • Published 19 days ago • 20
Kronos: A Foundation Model for the Language of Financial Markets Paper • 2508.02739 • Published Aug 2, 2025 • 43