Extreme Compression of Large Language Models via Additive Quantization Paper • 2401.06118 • Published Jan 11, 2024 • 14
ISTA-DASLab/Meta-Llama-3-70B-Instruct-AQLM-2Bit-1x16 Text Generation • 11B • Updated May 13, 2024 • 15 • 20