Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution Paper • 2606.06492 • Published 8 days ago • 83
Cosmos 3: Omnimodal World Models for Physical AI Paper • 2606.02800 • Published 11 days ago • 115
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published 21 days ago • 46
Geometry-Aware Representation Denoising for Robust Multi-view 3D Reconstruction Paper • 2605.26230 • Published 18 days ago • 41
GenRecon: Bridging Generative Priors for Multi-View 3D Scene Reconstruction Paper • 2605.23888 • Published 21 days ago • 14
Cartridges: Lightweight and general-purpose long context representations via self-study Paper • 2506.06266 • Published Jun 6, 2025 • 8
LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation Paper • 2605.18739 • Published 25 days ago • 113
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published May 4 • 348
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation Paper • 2605.03849 • Published May 5 • 126
Video Analysis and Generation via a Semantic Progress Function Paper • 2604.22554 • Published Apr 24 • 63
Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation Paper • 2604.18168 • Published Apr 20 • 96
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published Apr 9 • 247
Running on Zero Agents Featured 952 FLUX.2 [dev] 💻 952 Generate or edit images from text and optional photos
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published Mar 16 • 154
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper • 2603.03143 • Published Mar 3 • 145