SigLino: Vision Foundation Models (SigLIP2 + DINOv3) Collection Vision encoders distilled from DINOv3 and SigLIP2 (MoE & Dense). CVPR 2026. • 6 items • Updated Apr 10 • 17
VINCIE Collection A diffusion transformer model for in-context image generation and editing • 6 items • Updated Mar 19 • 13
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published Nov 19, 2025 • 234
view article Article State of open video generation models in Diffusers +1 sayakpaul, a-r-r-o-w, dn6 • Jan 27, 2025 • 71