Granite 4.1 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 6 items • Updated Apr 29 • 58
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 23 items • Updated 4 days ago • 322
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 ggerganov, ngxson, allozaur, lysandre, victor, julien-c • Feb 20 • 506
Nemotron ColEmbed V2 Collection State-of-the-Art Late Interaction Vision-Language Embedding Models • 3 items • Updated 4 days ago • 14
Onnxruntime DirectML GenAI Collection Model Powered by Onnxruntime DirectML GenAI • 12 items • Updated Aug 6, 2024 • 1
Cosmos-Reason2 Collection ⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3 • 8 items • Updated 4 days ago • 26
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 14 items • Updated 4 days ago • 55
Health AI Developer Foundations (HAI-DEF) Collection Groups models released for use in health AI by Google. Read more about HAI-DEF at http://goo.gle/hai-def • 22 items • Updated Mar 12 • 220
HomePhi4 - Home Assistant Reasoning LLM Collection A collection of quants and merges of HomePhi4, resulting from microsoft/Phi-4-mini-reasoning being finetuned against acon96/Home-Assistant-Requests. • 4 items • Updated Dec 6, 2025 • 1
Granite 4.0 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 11 items • Updated Apr 29 • 221