1 14 3

Woojung Song

Opusdei

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago

ABot-Earth 0.5: Generative 3D Earth Model

upvoted a paper 15 days ago

World Pilot: Steering Vision-Language-Action Models with World-Action Priors

upvoted a paper 15 days ago

Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

View all activity

Organizations

None yet

upvoted 4 papers 15 days ago

ABot-Earth 0.5: Generative 3D Earth Model

Paper • 2606.09967 • Published 18 days ago • 482

World Pilot: Steering Vision-Language-Action Models with World-Action Priors

Paper • 2606.12403 • Published 16 days ago • 26

Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

Paper • 2606.12191 • Published 16 days ago • 67

Agents' Last Exam

Paper • 2606.05405 • Published 23 days ago • 363

authored a paper 16 days ago

Human Psychometric Questionnaires Mischaracterize LLM Behavior

Paper • 2509.10078 • Published 28 days ago • 36

liked a model 17 days ago

Value4AI/ValueLlama-3-8B

Text Generation • 8B • Updated Sep 19, 2024 • 70 • • 6

upvoted 2 papers 17 days ago

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

Paper • 2606.07297 • Published 21 days ago • 119

Human Psychometric Questionnaires Mischaracterize LLM Behavior

Paper • 2509.10078 • Published 28 days ago • 36

submitted a paper to Daily Papers 17 days ago

Human Psychometric Questionnaires Mischaracterize LLM Behavior

Paper • 2509.10078 • Published 28 days ago • 36

upvoted a paper 17 days ago

SoCRATES: Towards Reliable Automated Evaluation of Proactive LLM Mediation across Domains and Socio-cognitive Variations

Paper • 2606.05563 • Published 22 days ago • 53

upvoted a paper 19 days ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 25 days ago • 135

authored a paper 20 days ago

ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time?

Paper • 2606.05553 • Published 22 days ago • 48

upvoted 2 papers 21 days ago

RobotValues: Evaluating Household Robots When Human Values Conflict

Paper • 2606.03312 • Published 24 days ago • 26

ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time?

Paper • 2606.05553 • Published 22 days ago • 48

liked a dataset about 1 month ago

Value4AI/Agent-ValueBench

Viewer • Updated May 14 • 9.06k • 2.22k • 3

upvoted 4 papers about 1 month ago

Where Should Diffusion Enter a Language Model? Geometry-Guided Hidden-State Replacement

Paper • 2605.14368 • Published May 14 • 16

Agent-ValueBench: A Comprehensive Benchmark for Evaluating Agent Values

Paper • 2605.10365 • Published May 11 • 9

KL for a KL: On-Policy Distillation with Control Variate Baseline

Paper • 2605.07865 • Published May 8 • 22

Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States

Paper • 2605.07579 • Published May 8 • 18

updated a collection almost 2 years ago

forfun

Collection

1 item • Updated Aug 7, 2024

Woojung Song

AI & ML interests

Recent Activity

Organizations

Opusdei's activity