OpenGVLab/internimage_h_22kto1k_640
Image Classification • 1B • Updated • 38 • 3
Computer Vision
Imagine Before You Predict: Interleaved Latent Visual Reasoning for Video Event Prediction
RIVER: A Real-Time Interaction Benchmark for Video LLMs