Vision LLM Collecting best Vision LLMs - to study and learn from them rhymes-ai/Aria Image-Text-to-Text • 25B • Updated Apr 23, 2025 • 117k • 638 microsoft/OmniParser Image-Text-to-Text • Updated Dec 2, 2024 • 271 • 1.71k jadechoghari/Ferret-UI-Gemma2b Image-Text-to-Text • 3B • Updated Oct 18, 2024 • 161 • 52 jadechoghari/Ferret-UI-Llama8b Image-Text-to-Text • 8B • Updated Jan 8, 2025 • 25 • 68
Image and 3D GenAI/Reconstruct nvidia/nvpanoptix-3d-v1.1-front3d Updated Mar 24 • 11 • 4 nvidia/NV-OneFormer Updated 7 days ago • 9
Image and 3D GenAI/Reconstruct nvidia/nvpanoptix-3d-v1.1-front3d Updated Mar 24 • 11 • 4 nvidia/NV-OneFormer Updated 7 days ago • 9
Vision LLM Collecting best Vision LLMs - to study and learn from them rhymes-ai/Aria Image-Text-to-Text • 25B • Updated Apr 23, 2025 • 117k • 638 microsoft/OmniParser Image-Text-to-Text • Updated Dec 2, 2024 • 271 • 1.71k jadechoghari/Ferret-UI-Gemma2b Image-Text-to-Text • 3B • Updated Oct 18, 2024 • 161 • 52 jadechoghari/Ferret-UI-Llama8b Image-Text-to-Text • 8B • Updated Jan 8, 2025 • 25 • 68