nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16 Text Generation • 561B • Updated about 11 hours ago • 47.3k • 124
Joint Agent Memory and Exploration Learning via Novelty Signals Paper • 2606.01528 • Published 5 days ago • 14
Skill is Not One-Size-Fits-All: Model-Aware Skill Alignment for LLM Agents Paper • 2605.30723 • Published 8 days ago • 16
When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs Paper • 2605.24202 • Published 15 days ago • 17
Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents Paper • 2605.30621 • Published 9 days ago • 19
AutoMedBench: Towards Medical AutoResearch with Agentic AI Models Paper • 2606.01961 • Published 5 days ago • 25
SkillAdaptor: Self-Adapting Skills for LLM Agents from Trajectories Paper • 2606.01311 • Published 6 days ago • 32
NITP: Next Implicit Token Prediction for LLM Pre-training Paper • 2605.24956 • Published 13 days ago • 34