Beyond APIs: Probing the Limits of MLLMs in Physical Tool Use
Paper • 2606.10803 • Published • 2
We focus on Natural Language Processing and Multimodal Learning, exploring generative AI across different modalities.
Optical Reasoning: Rethinking Images as an Expressive Reasoning Medium Beyond Text
One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment