TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World Scenarios Paper • 2602.01675 • Published Feb 2 • 10
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published Jan 8, 2025 • 96