Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning Paper • 2606.10968 • Published 2 days ago • 41
ChemDFM-R: An Chemical Reasoner LLM Enhanced with Atomized Chemical Knowledge Paper • 2507.21990 • Published Jul 29, 2025 • 27