OpenGVLab

community

https://github.com/opengvlab

Activity Feed Request to join this org

AI & ML interests

Computer Vision

Recent Activity

Eurayka authored a paper 2 days ago

TimeLens2: Generalist Video Temporal Grounding with Multimodal LLMs

wzk1015 submitted a paper 2 days ago

WorldCupArena: Fine-Grained Evaluation of Language Models and Deep-Research Agents on Football Forecasting

shepnerd updated a collection 29 days ago

View all activity

Papers

Imagine Before You Predict: Interleaved Latent Visual Reasoning for Video Event Prediction

RIVER: A Real-Time Interaction Benchmark for Video LLMs

View all Papers

OpenGVLab 's models 286

OpenGVLab/InternVL3_5-241B-A28B-MPO

Image-Text-to-Text • 241B • Updated Aug 29, 2025 • 33 • 2

OpenGVLab/InternVL3_5-241B-A28B-Pretrained

Image-Text-to-Text • 241B • Updated Aug 29, 2025 • 30 • 1

OpenGVLab/InternVL3_5-241B-A28B-Instruct

Image-Text-to-Text • 241B • Updated Aug 29, 2025 • 5.39k • 16

OpenGVLab/InternVL3_5-38B-MPO

Image-Text-to-Text • 38B • Updated Aug 29, 2025 • 29 • 2

OpenGVLab/InternVL3_5-38B-Pretrained

Image-Text-to-Text • 38B • Updated Aug 29, 2025 • 28 • 2

OpenGVLab/InternVL3_5-38B

Image-Text-to-Text • 38B • Updated Aug 29, 2025 • 35.1k • 45

OpenGVLab/InternVL3_5-30B-A3B-MPO

Image-Text-to-Text • 31B • Updated Aug 29, 2025 • 32 • 4

OpenGVLab/InternVL3_5-30B-A3B

Image-Text-to-Text • 31B • Updated Aug 29, 2025 • 118k • 43

OpenGVLab/InternVL3_5-30B-A3B-Pretrained

Image-Text-to-Text • 31B • Updated Aug 29, 2025 • 30 • 1

OpenGVLab/InternVL3_5-38B-Instruct

Image-Text-to-Text • 38B • Updated Aug 29, 2025 • 14.6k • 6

OpenGVLab/InternVL3_5-14B-MPO

Image-Text-to-Text • 15B • Updated Aug 29, 2025 • 34 • 3

OpenGVLab/InternVL3_5-14B

Image-Text-to-Text • 15B • Updated Aug 29, 2025 • 25.7k • 30

OpenGVLab/InternVideo2_5_Chat_8B

Video-Text-to-Text • 8B • Updated Aug 4, 2025 • 2.98k • 91

OpenGVLab/ScaleCUA_Env

Updated Jul 31, 2025 • 2

OpenGVLab/InternVideo2-Stage2_6B-224p-f4

Updated Jul 30, 2025 • 6

OpenGVLab/Mono-InternVL-2B-S1-3

Image-Text-to-Text • 3B • Updated Jul 22, 2025 • 21 • 1

OpenGVLab/Mono-InternVL-2B-S1-2

Image-Text-to-Text • 3B • Updated Jul 22, 2025 • 20 • 1

OpenGVLab/Mono-InternVL-2B-S1-1

Image-Text-to-Text • 3B • Updated Jul 22, 2025 • 11

OpenGVLab/Docopilot-8B

Image-Text-to-Text • 8B • Updated Jul 20, 2025 • 17 • 3

OpenGVLab/Docopilot-2B

Image-Text-to-Text • 2B • Updated Jul 20, 2025 • 13 • 8

OpenGVLab/ZeroGUI-OSWorld-7B

Image-Text-to-Text • 8B • Updated Jun 20, 2025 • 12 • 7

OpenGVLab/InternVideo1.0

Video Classification • Updated Jun 10, 2025 • 1

OpenGVLab/ZeroGUI-AndroidLab-7B

Image-Text-to-Text • 8B • Updated May 30, 2025 • 14 • 5

OpenGVLab/InternVL3-9B-Instruct

Image-Text-to-Text • 9B • Updated May 29, 2025 • 125 • 4

OpenGVLab/InternVL3-9B

Image-Text-to-Text • 9B • Updated May 29, 2025 • 1.8k • 25

OpenGVLab/VisualPRM-8B-v1_1

Image-Text-to-Text • 8B • Updated May 29, 2025 • 85 • 9

OpenGVLab/InternVideo2_CLIP_S

0.4B • Updated May 22, 2025 • 344 • 3

OpenGVLab/VideoChat-Flash-Qwen2_5-7B-1M_res224

Video-Text-to-Text • 8B • Updated May 16, 2025 • 150 • 2

OpenGVLab/InternVL_2_5_HiCo_R64

Video-Text-to-Text • 8B • Updated May 13, 2025 • 93 • 4

OpenGVLab/VisualPRM-8B

Image-Text-to-Text • 8B • Updated May 6, 2025 • 82 • 18