inference-optimization/dflash-DeepSeek-V4-Flash-all-swa-muon-speculators-online-500k 2B • Updated about 24 hours ago
inference-optimization/dflash-DeepSeek-V4-Flash-all-swa-muon-speculators-online-500k 2B • Updated about 24 hours ago
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-muon-ckpt5 2B • Updated 1 day ago
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-muon-ckpt5 2B • Updated 1 day ago
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step420136 2B • Updated 4 days ago • 56
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step420136 2B • Updated 4 days ago • 56
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-muon-ckpt4 2B • Updated 4 days ago • 17
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-muon-ckpt4 2B • Updated 4 days ago • 17