VGenST-Bench: A Benchmark for Spatio-Temporal Reasoning via Active Video Synthesis Paper • 2605.22570 • Published 21 days ago • 24
Group3D: MLLM-Driven Semantic Grouping for Open-Vocabulary 3D Object Detection Paper • 2603.21944 • Published Mar 23 • 26