Jackrong/Qwopus3.6-27B-v2-MTP-GGUF Image-Text-to-Text • 0.5B • Updated 13 days ago • 168k • 359
Progressive Residual Warmup for Language Model Pretraining Paper • 2603.05369 • Published Mar 5 • 36