view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 qgallouedec, stevhliu, pcuenq, sergiopaniego • Mar 31 • 54
view article Article Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains JetBrains • 8 days ago • 29