inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-500k-ckpt2 0.6B • Updated about 7 hours ago
inference-optimization/Qwen3-8B-from-Qwen3-8B_regen-speculators.eagle3-qwen3arch-ckpt1 1B • Updated 1 day ago
inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-500k-ckpt1-20260609-0052 0.6B • Updated 2 days ago • 5
inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-500k-ckpt1 0.6B • Updated 3 days ago • 136
inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-500k-ckpt0.5 0.6B • Updated 3 days ago • 11
inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-500k-ckpt0 0.6B • Updated 6 days ago • 106
inference-optimization/Llama-4-Scout-1.7B-0.4B-Instruct Image-Text-to-Text • 2B • Updated 8 days ago • 23
inference-optimization/ctest-Qwen3.5-9B-sliding-window-all-speculator.dflash 2B • Updated 8 days ago • 39
inference-optimization/ctest-Qwen3.5-9B-sliding-window-speculator.dflash 2B • Updated 8 days ago • 56
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w4a16 Text Generation • 32B • Updated 22 days ago • 197