deadbydawn101/RavenX-CyberAgent-Qwen3.6-35B-A3B-Opus-4.7-OpenMythos-Pentester-BugHunter-RATH-GGUF Text Generation • 36B • Updated about 2 hours ago • 1.19k • 7
DavidAU/Qwen3.5-4B-Claude-4.6-OS-Auto-Variable-HERETIC-UNCENSORED-THINKING Image-Text-to-Text • 5B • Updated Mar 29 • 554 • 16
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4 Text Generation • 335B • Updated 2 days ago • 66.2k • • 142
Running 11 TurboQuant on Consumer GPUs — 100K Context on RTX 3090, 64K on RTX 4070 🚀 11 Extend LLM context to 100K tokens on consumer GPUs