Model checkpoints for paper "A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone".
Jitai Hao
JitaiHao
AI & ML interests
None yet
Recent Activity
updated a dataset 6 days ago
JitaiHao/deltakv_qwen_train_vr1.0_num40000_seqlen8192 published a dataset 6 days ago
JitaiHao/deltakv_qwen_train_vr1.0_num40000_seqlen8192 updated a dataset 6 days ago
JitaiHao/deltakv_qwen_train_v3.0_num40000_seqlen8192