free-bit's picture

free-bit

free-bit

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects

upvoted a paper 1 day ago

Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance

upvoted a paper 1 day ago

PhoneHarness: Harnessing Phone-Use Agents through Mixed GUI, CLI, and Tool Actions

View all activity

Organizations

None yet

upvoted 10 papers 1 day ago

DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects

Paper • 2606.15133 • Published 8 days ago • 64

Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance

Paper • 2606.19195 • Published 4 days ago • 103

PhoneHarness: Harnessing Phone-Use Agents through Mixed GUI, CLI, and Tool Actions

Paper • 2606.14832 • Published 9 days ago • 12

Guava: An Effective and Universal Harness for Embodied Manipulation

Paper • 2606.18363 • Published 5 days ago • 27

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

Paper • 2606.19338 • Published 4 days ago • 43

MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction

Paper • 2606.18558 • Published 4 days ago • 42

Playful Agentic Robot Learning

Paper • 2606.19419 • Published 4 days ago • 37

Thinking with Visual Grounding

Paper • 2606.16122 • Published 6 days ago • 7

ENPIRE: Agentic Robot Policy Self-Improvement in the Real World

Paper • 2606.19980 • Published 3 days ago • 7

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Paper • 2606.20515 • Published 3 days ago • 31

upvoted 3 papers 2 days ago

Learning from the Self-future: On-policy Self-distillation for dLLMs

Paper • 2606.18195 • Published 5 days ago • 70

LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling

Paper • 2606.18023 • Published 5 days ago • 168

Reinforcing Dual-Path Reasoning in Spatial Vision Language Models

Paper • 2606.17539 • Published 5 days ago • 14

upvoted 7 papers 3 days ago

RF-DETR: Neural Architecture Search for Real-Time Detection Transformers

Paper • 2511.09554 • Published Nov 12, 2025 • 12

Ultralytics YOLO26: Unified Real-Time End-to-End Vision Models

Paper • 2606.03748 • Published 19 days ago • 13

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 174

Fara-7B: An Efficient Agentic Model for Computer Use

Paper • 2511.19663 • Published Nov 24, 2025 • 22

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

Paper • 2606.03264 • Published 19 days ago • 20

Kronos: A Foundation Model for the Language of Financial Markets

Paper • 2508.02739 • Published Aug 2, 2025 • 43

APPO: Agentic Procedural Policy Optimization

Paper • 2606.12384 • Published 10 days ago • 75