This is the models of our paper "EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models".
xxr
xrxing
AI & ML interests
None yet
Recent Activity
upvoted a paper about 10 hours ago
MemTrain: Self-Supervised Context Memory Training upvoted a paper 1 day ago
Trust Region On-Policy Distillation submitted a paper 1 day ago
Trust Region On-Policy DistillationOrganizations
None yet