Mabry PRO

artificial-citizen

·

https://www.joshmabry.dev

Mabry1985

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

protoLabsAI/Ornith-1.0-9B-MTP-GGUF:Iq quants

new activity 2 days ago

protoLabsAI/Ornith-1.0-9B-MTP-GGUF:Vision support?

posted an update 2 days ago

Built OpenRouter's Fusion on our own LiteLLM gateway, then benchmarked whether it earned its cost. The detail that decides the design: in OpenRouter's own numbers, fusing a model with itself still gained ~6.7 points. So the engine is the judge synthesizing over diverse samples, not the mix of models. Self-MoA ("Rethinking Mixture-of-Agents", arXiv 2502.00674) backs it — aggregating samples from one strong model beats mixing in weaker ones, which usually dilutes quality. That maps cleanly onto local inference. A multi-model panel means holding N models resident, a non-starter on one shared card. Judged self-consistency needs only one, and ours already runs as two load-balanced replicas, so the samples spread across both GPUs for free. ~360-line CustomLLM provider, every sub-call looped back through the gateway so it keeps routing, fallbacks, and cost tracking, and a 29-prompt blind-ranked benchmark with an explicit ship rule. All MIT. Breakdown: https://protolabs.studio/blog/fusion-on-your-own-litellm-gateway Code: https://github.com/protoLabsAI/fusion-gateway

View all activity

Organizations

artificial-citizen 's datasets 6

artificial-citizen/zac_sample-dataset-tokenised

Viewer • Updated Mar 22, 2025 • 20 • 6

artificial-citizen/ava_chatml_full

Viewer • Updated Feb 5, 2024 • 214k • 18

artificial-citizen/chuck_norris_jokes_chatml

Viewer • Updated Feb 4, 2024 • 169 • 5

artificial-citizen/Evol-Instruct-Code

Viewer • Updated Feb 4, 2024 • 78.3k • 18

artificial-citizen/ava_chatml

Viewer • Updated Feb 4, 2024 • 6.53k • 5

artificial-citizen/tiny-cot-alpaca

Viewer • Updated Sep 27, 2023 • 599k • 11 • 1