Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA Paper • 2603.08501 • Published Mar 9
Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation Paper • 2604.05083 • Published Apr 6
LLM-Based Multi-Task Bangla Hate Speech Detection: Type, Severity, and Target Paper • 2510.01995 • Published Oct 2, 2025
POLAR: A Benchmark for Multilingual, Multicultural, and Multi-Event Online Polarization Paper • 2505.20624 • Published Feb 5
Beyond MCQ: An Open-Ended Arabic Cultural QA Benchmark with Dialect Variants Paper • 2510.24328 • Published Apr 16
Once Correct, Still Wrong: Counterfactual Hallucination in Multilingual Vision-Language Models Paper • 2602.05437 • Published Apr 21
CritiSense: Critical Digital Literacy and Resilience Against Misinformation Paper • 2603.16672 • Published 22 days ago
Multilingual and Multimodal LLMs in the Wild: Building for Low-Resource Languages Paper • 2605.17152 • Published 27 days ago