Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Building on HF
12.2
TFLOPS
2
3
Santosh Kompella
PRO
Sathya77
Follow
John6666's profile picture
Tharuncr7's profile picture
danjacobellis's profile picture
3 followers
·
13 following
AI & ML interests
LLMs Natural Language Processing (NLP) Transformers Deep Learning Machine Learning
Recent Activity
posted
an
update
3 days ago
Built a ViT for ×4 image super-resolution from scratch in PyTorch — sharing the model. No pretrained weights. Every component implemented from scratch: strided Conv2d patch embedding, multi-head self-attention across 1,024 tokens, 6 pre-norm transformer blocks, and a PixelShuffle reconstruction head for learned upsampling. Trained on real-images from LSDIR dataset with fp16 AMP on a laptop GPU. Tiled inference handles arbitrary input sizes. Current architecture: patch size 2, embed dim 64, 4 attention heads, 6 transformer blocks, ~786K parameters — test PSNR 23.30 dB. The model handles broad structure well — fine textures and sharp edges need more capacity. Working on a larger configuration next. 🤗 Space: https://huggingface.co/spaces/Sathya77/ViT-ISR-Tiny-LSDIR Feedback welcome — especially on the architecture choices.
updated
a model
4 days ago
Sathya77/ViT-ISR-Tiny-LSDIR
published
a model
4 days ago
Sathya77/ViT-ISR-Tiny-LSDIR
View all activity
Organizations
None yet
Sathya77
's datasets
1
Sort: Recently updated
Sathya77/telecom_plans
Viewer
•
Updated
Aug 30, 2025
•
125
•
8