arxiv:2409.00492
Michael Goin
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
updated a model 3 days ago
google/gemma-4-E4B-it-qat-mobile-ct updated a model 3 days ago
google/gemma-4-E2B-it-qat-mobile-ct published a model 5 days ago
google/gemma-4-E4B-it-qat-mobile-ct