Running Agents Featured 35 QwenScope 🔥 35 Explore and steer Qwen3 model features with interactive heatmaps
Running on Zero Agents Featured 44 RF-DETR Realtime Webcam Demo 🎯 44 Segment objects in live webcam and uploaded media
TIPSv2 Collection TIPSv2 foundational vision-language models. Webpage: https://gdm-tipsv2.github.io/ • 9 items • Updated Apr 14 • 33
RDP LoRA: Geometry-Driven Identification for Parameter-Efficient Adaptation in Large Language Models Paper • 2604.19321 • Published Apr 21 • 8
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 147k • • 2.87k
SigLino: Vision Foundation Models (SigLIP2 + DINOv3) Collection Vision encoders distilled from DINOv3 and SigLIP2 (MoE & Dense). CVPR 2026. • 6 items • Updated Apr 10 • 17
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking Paper • 2601.04720 • Published Jan 8 • 59
Running on Zero Agents Featured 945 FLUX.2 [dev] 💻 945 Generate or edit images from text and optional photos
view article Article Fine-Tuning MetaCLIP-2 for Image Classification on Downstream Tasks prithivMLmods • Nov 15, 2025 • 7