15 47 2

Yang Shi

DogNeverSleep

https://FrankYang-17.github.io/

FrankYang-17

AI & ML interests

👨🏻‍🎓PhD student at Peking University

Recent Activity

published a dataset 10 days ago

KeyFrame-Review/Data-301-377

upvoted a paper 12 days ago

VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding

upvoted a paper 12 days ago

LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing

View all activity

Organizations

published a dataset 10 days ago

KeyFrame-Review/Data-301-377

Viewer • Updated 10 days ago • 2.45k • 37

upvoted 2 papers 12 days ago

VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding

Paper • 2606.05259 • Published 15 days ago • 39

LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing

Paper • 2606.06042 • Published 14 days ago • 24

updated a dataset 15 days ago

KeyFrame-Review/Review-Data

Viewer • Updated 15 days ago • 12.2k • 47

published a dataset 16 days ago

KeyFrame-Review/Review-Data

Viewer • Updated 15 days ago • 12.2k • 47

upvoted a paper 17 days ago

DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory

Paper • 2605.31336 • Published 20 days ago • 12

upvoted a paper 20 days ago

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

Paper • 2605.30263 • Published 21 days ago • 58

authored a paper 22 days ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published 24 days ago • 38

upvoted a paper 22 days ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published 24 days ago • 38

submitted a paper to Daily Papers 22 days ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published 24 days ago • 38

upvoted a paper 23 days ago

Channel-wise Vector Quantization

Paper • 2605.26089 • Published 24 days ago • 15

authored a paper 27 days ago

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Paper • 2605.22012 • Published 28 days ago • 46

upvoted a paper 27 days ago

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Paper • 2605.22012 • Published 28 days ago • 46

New activity in DogNeverSleep/Artifact-Bench 28 days ago

Add dataset card for Artifact-Bench

#2 opened 28 days ago by

nielsr

authored 2 papers 29 days ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Paper • 2605.18984 • Published May 18 • 22

MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation

Paper • 2605.20183 • Published about 1 month ago • 14

upvoted 2 papers 29 days ago

MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation

Paper • 2605.20183 • Published about 1 month ago • 14

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Paper • 2605.18984 • Published May 18 • 22

submitted a paper to Daily Papers 29 days ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Paper • 2605.18984 • Published May 18 • 22

updated a dataset about 1 month ago

DogNeverSleep/Artifact-Bench

Viewer • Updated 28 days ago • 1.35k • 2.47k • 3

Yang Shi

AI & ML interests

Recent Activity

Organizations

DogNeverSleep's activity

Add dataset card for Artifact-Bench