Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus Paper • 2606.15345 • Published 8 days ago • 14
ShirohaNaruse/game-review-sentiment-distilbert Text Classification • 67M • Updated 24 days ago • 37 • 1
SAM 3D Animal: Promptable Animal 3D Reconstruction from Images in the Wild Paper • 2605.07604 • Published May 8 • 4
WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation Paper • 2605.25874 • Published 27 days ago • 103
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning Paper • 2605.06130 • Published May 7 • 116
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published May 3 • 171
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published Apr 8 • 123