DaveGabe/TinyStoriesV2_cleaned-inv-first-voc2048-seq256-overlap25 Viewer • Updated Sep 16, 2025 • 483k • 21
DaveGabe/TinyStoriesV2_cleaned-inv-first-voc2048-seq256-overlap25 Viewer • Updated Sep 16, 2025 • 483k • 21
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers Paper • 2504.20752 • Published Apr 29, 2025 • 96
Running on Zero Agents Featured 925 Screenshot to HTML ⚡ 925 Generate HTML code from a website screenshot
Runtime error Agents Featured 1.01k Model Memory Utility 🚀 1.01k Calculate GPU memory needed for training Hugging Face models