meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • 109B • Updated May 22, 2025 • 713k • • 1.31k
Qwen/Qwen2.5-VL-7B-Instruct Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 8.66M • • 1.59k
3loi/SER-Odyssey-Baseline-WavLM-Categorical Audio Classification • 0.3B • Updated Jun 12, 2024 • 317 • 11
3loi/SER-Odyssey-Baseline-WavLM-Multi-Attributes Audio Classification • 0.3B • Updated Jun 12, 2024 • 1.16k • 13
3loi/SER-Odyssey-Baseline-WavLM-Arousal Audio Classification • 0.3B • Updated Jun 12, 2024 • 148 • 2
3loi/SER-Odyssey-Baseline-WavLM-Valence Audio Classification • 0.3B • Updated Jun 12, 2024 • 155 • 1
3loi/SER-Odyssey-Baseline-WavLM-Dominance Audio Classification • 0.3B • Updated Jun 12, 2024 • 4 • 1
google/vit-base-patch16-224-in21k Image Feature Extraction • 86.4M • Updated Feb 5, 2024 • 1.93M • 411
3loi/SER-Odyssey-Baseline-WavLM-Categorical Audio Classification • 0.3B • Updated Jun 12, 2024 • 317 • 11