arxiv:2605.29447
Hao Jiang
Lutalica
AI & ML interests
Multimodal LLMs, LLM Reasoning, Reinforcement Learning, Efficient Inference
Recent Activity
authored a paper 1 day ago
Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents upvoted a paper 1 day ago
Pyramid Texture Filtering authored a paper 5 days ago
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use