huawei-csl/Qwen3-8B-PreSINQ-GGUF
Text Generation • 8B • Updated • 10 • 2
None defined yet.
KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights