DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning Paper • 2402.09136 • Published Feb 14, 2024 • 1
LogicPro: Improving Complex Logical Reasoning via Program-Guided Learning Paper • 2409.12929 • Published Sep 19, 2024 • 2
Revisit Self-Debugging with Self-Generated Tests for Code Generation Paper • 2501.12793 • Published Jan 22, 2025
RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation Paper • 2502.09183 • Published Feb 13, 2025
InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models Paper • 2503.06692 • Published Mar 9, 2025 • 2
VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models Paper • 2505.15801 • Published May 21, 2025 • 17
MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task Paper • 2502.11684 • Published Feb 17, 2025 • 2
Do Large Language Models Excel in Complex Logical Reasoning with Formal Language? Paper • 2505.16998 • Published May 22, 2025 • 2
FRABench and GenEval: Scaling Fine-Grained Aspect Evaluation across Tasks, Modalities Paper • 2505.12795 • Published May 19, 2025
SCoder: Iterative Self-Distillation for Bootstrapping Small-Scale Data Synthesizers to Empower Code LLMs Paper • 2509.07858 • Published Sep 9, 2025
Prejudge-Before-Think: Enhancing Large Language Models at Test-Time by Process Prejudge Reasoning Paper • 2504.13500 • Published Apr 18, 2025