mm-evaluation

community

https://github.com/mm-evaluation

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

ZTWHHH updated a dataset 2 days ago

ZTWHHH updated a dataset 2 days ago

mm-eval/ZoomBench

ZTWHHH updated a dataset 2 days ago

mm-eval/ZeroBench

View all activity

models 18

mm-eval/WeMM-Chat-2k-CN

9B • Updated Oct 14, 2025 • 2

mm-eval/WeMM-Chat-CN

9B • Updated Oct 14, 2025 • 1

mm-eval/WeMM

10B • Updated Oct 14, 2025 • 1

mm-eval/VLMEvalKit

Updated Jul 20, 2025

mm-eval/llava-next-qwen-32b

33B • Updated May 7, 2025 • 1

mm-eval/minigpt4_13b

Updated May 4, 2025

mm-eval/minigpt4_7b

Updated May 4, 2025

mm-eval/minigpt4_v2

Updated May 4, 2025

mm-eval/Llama-3-LongVILA-8B-512Frames

Text Generation • Updated Apr 29, 2025 • 3

mm-eval/Llama-3-LongVILA-8B-1024Frames

Updated Apr 29, 2025 • 2

datasets 129

mm-eval/ST-VQA

Viewer • Updated 2 days ago • 4.07k • 53

mm-eval/OCRBench-v2

Viewer • Updated 2 days ago • 10k • 83

mm-eval/MEGA-Bench

Viewer • Updated 2 days ago • 7.18k • 74

mm-eval/DocVQA

Viewer • Updated 2 days ago • 10.5k • 76

mm-eval/xGQA

Viewer • Updated 2 days ago • 77.3k • 161

mm-eval/ZoomBench

Viewer • Updated 2 days ago • 845 • 64

mm-eval/ZeroBench

Viewer • Updated 2 days ago • 434 • 68 • 1

mm-eval/WorldVQA

Viewer • Updated 2 days ago • 3k • 114

mm-eval/WorldMedQA-V

Viewer • Updated 2 days ago • 1.14k • 60

mm-eval/Winoground

Viewer • Updated 2 days ago • 1.6k • 68

View 129 datasets