Perry the Platypus PRO

AgPerry

·

AI & ML interests

None yet

Recent Activity

new activity 6 days ago

huggingface/HuggingDiscussions:[FEEDBACK] Daily Papers

commentedon a paper 11 days ago

Function-Aware Fill-in-the-Middle as Mid-Training for Coding Agent Foundation Models

upvoted a paper 11 days ago

Function-Aware Fill-in-the-Middle as Mid-Training for Coding Agent Foundation Models

View all activity

Organizations

upvoted a paper 11 days ago

Function-Aware Fill-in-the-Middle as Mid-Training for Coding Agent Foundation Models

Paper • 2607.12463 • Published 13 days ago • 108

upvoted a paper 12 days ago

Search Beyond What Can Be Taught: Evolving the Knowledge Boundary in Agentic Visual Generation

Paper • 2607.05382 • Published 18 days ago • 87

upvoted 3 papers about 1 month ago

Dr-DCI: Scaling Direct Corpus Interaction via Dynamic Workspace Expansion

Paper • 2606.14885 • Published Jun 12 • 11

ScientistOne: Towards Human-Level Autonomous Research via Chain-of-Evidence

Paper • 2605.26340 • Published May 25 • 37

Where, What, Why, and Importance: Structured Defect Grounding for Text-to-Image Feedback

Paper • 2606.06113 • Published Jun 4 • 16

upvoted a paper about 2 months ago

MIRA: Mid-training Rubric Anchoring for Source-Aware Data Selection

Paper • 2605.30288 • Published May 29 • 23

upvoted a paper 2 months ago

RewardHarness: Self-Evolving Agentic Post-Training

Paper • 2605.08703 • Published May 9 • 10

upvoted 4 collections 3 months ago

eval-papers-collection

8 items • Updated Apr 13 • 1

Reading list

5 items • Updated May 10 • 1

Papers

4 items • Updated Apr 28 • 1

ClawBench — Browser Agent Benchmark Suite

Benchmark dataset (V1+V2), live leaderboard Space, and full V1 execution traces — everything you need to run, regrade, or compare on ClawBench. • 5 items • Updated May 12 • 1

upvoted 5 papers 3 months ago

Dr. Bench: A Multidimensional Evaluation for Deep Research Agents, from Answers to Reports

Paper • 2510.02190 • Published Jan 29 • 20

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Paper • 2605.05242 • Published May 3 • 127

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published Apr 30 • 92

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Paper • 2604.24763 • Published Apr 27 • 71

VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

Paper • 2503.10582 • Published Mar 13, 2025 • 25

upvoted 4 collections 3 months ago

Vision

47 items • Updated 3 days ago • 4

Saved

5 items • Updated Apr 10 • 1

Paper

136 items • Updated 4 days ago • 2

Video understanding

62 items • Updated 5 days ago • 5