amd/Llama-2-70b-chat-hf-WMXFP4-AMXFP4-KVFP8-Scale-UINT8-MLPerf-GPTQ 37B • Updated Aug 5, 2025 • 18
amd/Llama-3.1-8B-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Jun 28, 2025 • 13 • 2
amd/Llama-3-8B-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Jun 28, 2025 • 13 • 2
amd/Llama2-7b-chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Jun 28, 2025 • 5
amd/Llama-2-7b-hf-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Jun 28, 2025 • 5
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Jun 28, 2025 • 10 • 1
amd/gemma-2-2b-awq-uint4-asym-g128-lmhead-g32-fp16-onnx-hybrid_v2 Text Generation • Updated Jun 23, 2025
amd/gemma-2-2b-awq-uint4-asym-g128-lmhead-g32-fp16-onnx-hybrid Text Generation • Updated Jun 23, 2025 • 15