s-nlp/tool-calling-hallucination-modernbert-base-glaive-100pct Token Classification • 0.1B • Updated 11 days ago • 16
s-nlp/tool-calling-hallucination-modernbert-base-glaive-100pct Token Classification • 0.1B • Updated 11 days ago • 16
s-nlp/tool-calling-hallucination-modernbert-large-glaive-100pct Token Classification • 0.4B • Updated 11 days ago • 10
s-nlp/tool-calling-hallucination-modernbert-large-glaive-100pct Token Classification • 0.4B • Updated 11 days ago • 10
s-nlp/tool-calling-hallucination-modernbert-base-unified-final Token Classification • 0.1B • Updated 11 days ago • 31
s-nlp/tool-calling-hallucination-modernbert-base-unified-final Token Classification • 0.1B • Updated 11 days ago • 31
ssurface/tool-calling-hallucination-modernbert-base-glaive-100pct Token Classification • 0.1B • Updated 11 days ago • 44
ssurface/tool-calling-hallucination-modernbert-base-glaive-100pct Token Classification • 0.1B • Updated 11 days ago • 44
ssurface/tool-calling-hallucination-modernbert-large-glaive-100pct Token Classification • 0.4B • Updated 11 days ago • 51
ssurface/tool-calling-hallucination-modernbert-large-glaive-100pct Token Classification • 0.4B • Updated 11 days ago • 51
ssurface/tool-calling-hallucination-modernbert-base-unified-final Token Classification • 0.1B • Updated 11 days ago • 59
ssurface/tool-calling-hallucination-modernbert-base-unified-final Token Classification • 0.1B • Updated 11 days ago • 59
Qwen3-4B CoT Compression Study Collection LoRA adapters trained for 5 progressively shorter chain-of-thought styles on GSM8K, plus the eval artifacts behind the Pareto curve. • 6 items • Updated 19 days ago • 1