Hanxu Hu PRO

HanxuHU

·

https://hanxuhu.github.io/

AI & ML interests

LLM, NLP

Recent Activity

liked a model 14 days ago

meituan-longcat/LongCat-2.0

authored a paper about 2 months ago

DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning

authored a paper about 2 months ago

Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

View all activity

Organizations

Collections 3

View 3 collections

Papers 9

arxiv:2606.06428

arxiv:2603.11193

arxiv:2602.17684

arxiv:2510.17715

models 14

HanxuHU/Qwen2-0.5B-SFT

Updated Mar 31, 2025

HanxuHU/self-seq-Meta-Llama-3-8B-tulu100k_seq_it2_llama70b

Updated Jun 22, 2024 • 2 • 1

HanxuHU/self-seq-Meta-Llama-3-8B-tulu100k_base_ours_new_llama70b

Text Generation • Updated Jun 21, 2024 • 3 • 1

HanxuHU/sit_all_models

Updated Jun 20, 2024

HanxuHU/flancot_full_it1

Updated May 30, 2024

HanxuHU/sharegpt_filter

Updated May 29, 2024

HanxuHU/files

Updated May 13, 2024

HanxuHU/my-mLLMs

Updated May 10, 2024

HanxuHU/multilingual_mmmu

Updated Apr 3, 2024

HanxuHU/alpaca_topk_indices

Updated Mar 13, 2024

datasets 66

HanxuHU/rl-new-language

Viewer • Updated Jun 5 • 135k • 4.15k

HanxuHU/ocr_data_question_28k_Qwen3-8B

Viewer • Updated Oct 28, 2025 • 28k • 26

HanxuHU/usaco_v2

Viewer • Updated Oct 11, 2025 • 294 • 14

HanxuHU/math_copy1

Viewer • Updated Sep 30, 2025 • 12.5k • 47

HanxuHU/math

Viewer • Updated Sep 30, 2025 • 12.5k • 41

HanxuHU/mt_data

Viewer • Updated Dec 31, 2024 • 796k • 202

HanxuHU/gemma-llama-2-9b-it-ultrafeedback-annotate-ultrafb-judge-5-maj

Viewer • Updated Nov 28, 2024 • 60k • 33

HanxuHU/gemma2-9B-it-ultrafeedback-annotate-ultrafb-merge-single-filtered

Viewer • Updated Nov 26, 2024 • 56.4k • 31

HanxuHU/gemma2-9B-it-ultrafeedback-annotate-ultrafb-judge-5-majority-filtered

Viewer • Updated Nov 26, 2024 • 55.2k • 17

HanxuHU/gemma2-9B-it-ultrafeedback-annotate-ultrafb-merge-single-judge

Viewer • Updated Nov 25, 2024 • 60.7k • 12

View 66 datasets