QED-Nano: Teaching a Tiny Model to Prove Hard Theorems
π
74
Who needs 1T parameters? Olympiad proofs with a 4B model
Visualize onβpolicy distillation token alignment
Evaluate multilingual models using FineTasks
The secrets to building world-class LLMs