Perry the Platypus PRO

AgPerry

6 31 3

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

huggingface/HuggingDiscussions:[FEEDBACK] Daily Papers

commentedon a paper 6 days ago

Function-Aware Fill-in-the-Middle as Mid-Training for Coding Agent Foundation Models

upvoted a paper 6 days ago

Function-Aware Fill-in-the-Middle as Mid-Training for Coding Agent Foundation Models

View all activity

Organizations

New activity in huggingface/HuggingDiscussions 1 day ago

[FEEDBACK] Daily Papers

🔥❤️ 21

207

#32 opened about 2 years ago by

kramp

commented a paper 6 days ago

Function-Aware Fill-in-the-Middle as Mid-Training for Coding Agent Foundation Models

Paper • 2607.12463 • Published 9 days ago • 107 •

upvoted a paper 6 days ago

Function-Aware Fill-in-the-Middle as Mid-Training for Coding Agent Foundation Models

Paper • 2607.12463 • Published 9 days ago • 107

upvoted a paper 7 days ago

Search Beyond What Can Be Taught: Evolving the Knowledge Boundary in Agentic Visual Generation

Paper • 2607.05382 • Published 14 days ago • 86

updated a dataset 12 days ago

AgPerry/rsi-bench

Viewer • Updated 12 days ago • 300 • 44

published a dataset 13 days ago

AgPerry/rsi-bench

Viewer • Updated 12 days ago • 300 • 44

upvoted 3 papers about 1 month ago

Dr-DCI: Scaling Direct Corpus Interaction via Dynamic Workspace Expansion

Paper • 2606.14885 • Published Jun 12 • 11

ScientistOne: Towards Human-Level Autonomous Research via Chain-of-Evidence

Paper • 2605.26340 • Published May 25 • 37

Where, What, Why, and Importance: Structured Defect Grounding for Text-to-Image Feedback

Paper • 2606.06113 • Published Jun 4 • 15

updated a dataset about 1 month ago

TIGER-Lab/ClawBench

Viewer • Updated Jun 10 • 283 • 511 • 1

upvoted a paper about 2 months ago

MIRA: Mid-training Rubric Anchoring for Source-Aware Data Selection

Paper • 2605.30288 • Published May 29 • 23

updated a Space about 2 months ago

ClawBench Leaderboard

🦀

Can AI agents complete everyday online tasks?

updated 4 datasets about 2 months ago

commented a paper 2 months ago

RewardHarness: Self-Evolving Agentic Post-Training

Paper • 2605.08703 • Published May 9 • 10 •

upvoted a paper 2 months ago

RewardHarness: Self-Evolving Agentic Post-Training

Paper • 2605.08703 • Published May 9 • 10

submitted a paper to Daily Papers 2 months ago

RewardHarness: Self-Evolving Agentic Post-Training

Paper • 2605.08703 • Published May 9 • 10

updated a collection 2 months ago

ClawBench

Collection

Benchmark dataset (V1+V2), live leaderboard Space, and full V1 execution traces — everything you need to run, regrade, or compare on ClawBench. • 5 items • Updated May 12

Perry the Platypus PRO

AI & ML interests

Recent Activity

Organizations

AgPerry's activity

[FEEDBACK] Daily Papers

ClawBench Leaderboard