Takashi Ishida
tksii
AI & ML interests
None yet
Recent Activity
authored a paper about 21 hours ago
Mitigating Reward Hacking in RLHF via Advantage Sign Robustness authored a paper about 21 hours ago
LLM Routing with Dueling Feedback authored a paper about 21 hours ago
Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests