3 40 2

Zhe Cao

MichaelCaoo

MichaelCao0

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

upvoted a paper 3 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

upvoted a paper 3 days ago

OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data

View all activity

Organizations

upvoted a paper 1 day ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

Paper • 2606.19338 • Published 2 days ago • 42

upvoted 2 papers 3 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

Paper • 2606.14777 • Published 9 days ago • 189

OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data

Paper • 2606.13432 • Published 8 days ago • 101

upvoted a paper 9 days ago

ABot-Earth 0.5: Generative 3D Earth Model

Paper • 2606.09967 • Published 11 days ago • 474

upvoted a paper 10 days ago

CoVEBench: Can Video Editing Models Handle Complex Instructions?

Paper • 2606.08415 • Published 12 days ago • 48

upvoted a paper 11 days ago

UniSHARP: Universal Sharp Monocular View Synthesis

Paper • 2606.07514 • Published 14 days ago • 14

upvoted a paper 15 days ago

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Paper • 2606.02060 • Published 18 days ago • 54

upvoted a paper 21 days ago

SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing

Paper • 2604.19587 • Published Apr 21 • 48

upvoted a paper 23 days ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published 25 days ago • 38

upvoted 4 papers about 1 month ago

DexJoCo: A Benchmark and Toolkit for Task-Oriented Dexterous Manipulation on MuJoCo

Paper • 2605.16257 • Published May 15 • 54

MemPrivacy: Privacy-Preserving Personalized Memory Management for Edge-Cloud Agents

Paper • 2605.09530 • Published May 10 • 148

Rubric-based On-policy Distillation

Paper • 2605.07396 • Published May 8 • 41

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published May 7 • 235

upvoted 3 papers about 2 months ago

MultiWorld: Scalable Multi-Agent Multi-View Video World Models

Paper • 2604.18564 • Published Apr 20 • 46

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Paper • 2604.18486 • Published Apr 20 • 95

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published Apr 20 • 86

upvoted a paper 2 months ago

DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation

Paper • 2604.14683 • Published Apr 16 • 36

authored a paper 2 months ago

Frontier-Eng: Benchmarking Self-Evolving Agents on Real-World Engineering Tasks with Generative Optimization

Paper • 2604.12290 • Published Apr 14 • 16

upvoted 2 papers 2 months ago

CodeTracer: Towards Traceable Agent States

Paper • 2604.11641 • Published Apr 13 • 38