Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
1605.6
TFLOPS
76
8
340
ginipick
ginipick
Follow
Armandoumas's profile picture
huathedev's profile picture
leemonz's profile picture
706 followers
·
156 following
AI & ML interests
None yet
Recent Activity
updated
a Space
about 3 hours ago
ginigen/AI
reacted
to
SeaWolf-AI
's
post
with 🧠
7 days ago
Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀 We're excited to release Darwin-60B-DUO, the Darwin family's first DUO model. Take two domain-verified specialists, hide them behind a single OpenAI-compatible endpoint, and let a router decide which one (or both) answers. You see one model, one API — but get the best of both. The number that matters: on the full 198-question GPQA Diamond, Darwin-60B-DUO hits 88.38%. The constituents alone land at 69.70% (Darwin-28B-REASON) and 77.27% (AWAXIS-Think-31B); a naive cascade only reaches 83.84%. The DUO clears them all. Two small specialists, intelligently routed, beat one big generalist on cost and quality. Both are independently verified — Darwin-28B-REASON is #3 on the HF GPQA Diamond leaderboard, AWAXIS-Think-31B is #1 on Korea's national K-AI Leaderboard (MSIT). The brains is a Hybrid-A router picking one of five strategies on the fly. Korean → AWAXIS, English/STEM → Darwin (single-backend, ~70% of traffic at 1× cost). When a Korean answer needs rigorous English reasoning, split_refine fires — Darwin drafts, AWAXIS polishes; MCQ/short-answer runs both with self-consistency + cross-verify. Net effective cost: only ~1.3× a single 30B model. The part the community will care about: the gateway is model-agnostic and Apache-2.0. Point it at any two OpenAI-compatible backends and you've got a DUO in minutes — teach router.py when to use which, and parallel calls, response merging, and routing transparency via _duo_route are handled for you. Fork it and tell us what you built. Painless deploy: docker compose up for both vLLM backends + gateway; FP8 ~30GB colocates on a single B200/H100. One git clone (~120GB). Text-only for now, streaming in v1.1. Two SOTAs, one endpoint. Come build your own on the Community tab. 👇 🔗 https://huggingface.co/FINAL-Bench/Darwin-60B-DUO
liked
a model
7 days ago
FINAL-Bench/Darwin-60B-DUO
View all activity
Organizations
ginipick
's models
11
Sort: Recently updated
ginipick/Qwen-Image-Edit-Rapid-AIO
Text-to-Image
•
Updated
Nov 2, 2025
•
1
ginipick/GLM-4.6
Text Generation
•
357B
•
Updated
Nov 2, 2025
•
6
ginipick/neutts-air
Text-to-Speech
•
0.7B
•
Updated
Nov 2, 2025
•
35
•
1
ginipick/MiniMax-M2
Text Generation
•
229B
•
Updated
Nov 2, 2025
•
10
ginipick/PaddleOCR-VL
Image-Text-to-Text
•
1.0B
•
Updated
Nov 2, 2025
•
10
ginipick/DeepSeek-OCR
Image-Text-to-Text
•
3B
•
Updated
Nov 2, 2025
•
3
ginipick/Gemma-3-R1984-4B
Image-Text-to-Text
•
4B
•
Updated
Apr 22, 2025
•
9
•
•
52
ginipick/QwQ-32B-NF4
Text Generation
•
33B
•
Updated
Mar 21, 2025
•
9
•
30
ginipick/wan-lora-cat
Text-to-Video
•
Updated
Mar 16, 2025
•
2
ginipick/c-bag
Updated
Mar 13, 2025
ginipick/flux-lora-eric-cat
Text-to-Image
•
Updated
Dec 2, 2024
•
47
•
•
230