Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
mm-evaluation
community
https://github.com/mm-evaluation
Activity Feed
Request to join this org
Follow
7
AI & ML interests
None defined yet.
Recent Activity
ZTWHHH
updated
a dataset
about 18 hours ago
mm-eval/TheoremQA
ZTWHHH
published
a dataset
about 18 hours ago
mm-eval/TheoremQA
ZTWHHH
updated
a dataset
about 18 hours ago
mm-eval/NaturalBench
View all activity
Team members
4
models
18
Sort: Recently updated
mm-eval/WeMM-Chat-2k-CN
Updated
Oct 14, 2025
•
2
mm-eval/WeMM-Chat-CN
Updated
Oct 14, 2025
•
1
mm-eval/WeMM
Updated
Oct 14, 2025
•
2
mm-eval/VLMEvalKit
Updated
Jul 20, 2025
mm-eval/llava-next-qwen-32b
Updated
May 7, 2025
•
2
mm-eval/minigpt4_13b
Updated
May 4, 2025
mm-eval/minigpt4_7b
Updated
May 4, 2025
mm-eval/minigpt4_v2
Updated
May 4, 2025
mm-eval/Llama-3-LongVILA-8B-512Frames
Text Generation
•
Updated
Apr 29, 2025
•
1
mm-eval/Llama-3-LongVILA-8B-1024Frames
Updated
Apr 29, 2025
•
1
View 18 models
datasets
115
Sort: Recently updated
mm-eval/TheoremQA
Viewer
•
Updated
about 18 hours ago
•
53
mm-eval/NaturalBench
Updated
about 18 hours ago
•
89
mm-eval/OCRBench-v2
Viewer
•
Updated
about 18 hours ago
•
10k
•
69
mm-eval/WorldMedQA-V
Viewer
•
Updated
about 19 hours ago
•
1.14k
mm-eval/VLMsAreBlind
Viewer
•
Updated
about 19 hours ago
•
8.02k
mm-eval/AlgoPuzzleVQA
Viewer
•
Updated
about 19 hours ago
•
1.8k
mm-eval/PuzzleVQA
Viewer
•
Updated
about 19 hours ago
•
2k
mm-eval/MedXpertQA
Viewer
•
Updated
about 19 hours ago
•
2.01k
mm-eval/SciVQA
Viewer
•
Updated
about 19 hours ago
•
4.2k
mm-eval/GMAI-MMBench
Viewer
•
Updated
about 19 hours ago
•
4.55k
View 115 datasets