Using RL to elicit context leverage ability of LLMs to learn unseen languages!
Hanxu Hu PRO
HanxuHU
AI & ML interests
LLM, NLP
Recent Activity
authored a paper about 12 hours ago
DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning authored a paper about 12 hours ago
Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation upvoted a paper about 23 hours ago
Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation