LongNet: Scaling Transformers to 1,000,000,000 Tokens Paper • 2307.02486 • Published Jul 5, 2023 • 82 • 17
Rethinking Cross-Layer Information Routing in Diffusion Transformers Paper • 2605.20708 • Published 29 days ago • 111 • 6
view article Article Introducing NVIDIA Cosmos Policy for Advanced Robot Control nvidia • Jan 29 • 48
view article Article NEO-unify: Building Native Multimodal Unified Models End to End sensenova • Mar 5 • 165