Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
2
3
Yurun Yuan
RyanYr
Follow
KenCao2007's profile picture
John6666's profile picture
xuanfeiren's profile picture
6 followers
·
2 following
yurun-yuan
AI & ML interests
None yet
Recent Activity
updated
a dataset
30 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0_matheval
updated
a model
30 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
published
a model
30 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
View all activity
Organizations
None yet
RyanYr
's models
30
Sort: Recently updated
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
Updated
30 days ago
•
13
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0_200
Updated
30 days ago
•
7
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
30 days ago
•
6
RyanYr/pg_sais-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl
Updated
30 days ago
•
8
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
30 days ago
•
7
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
30 days ago
•
7
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
30 days ago
•
7
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
30 days ago
•
6
RyanYr/pg_trajis-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_piref
Updated
30 days ago
•
8
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
30 days ago
•
5
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_piref
Updated
30 days ago
•
7
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
about 1 month ago
•
6
RyanYr/pg_sais-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl
Updated
about 1 month ago
•
6
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
about 1 month ago
•
3
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
about 1 month ago
•
2
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
May 4
•
5
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_kl
Updated
May 4
•
5
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
May 4
•
5
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
May 4
•
4
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
May 4
•
5
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl
Updated
May 4
•
5
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
May 4
•
3
RyanYr/pg_trajis-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B
Updated
May 4
•
6
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B
Updated
May 4
•
6
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
May 4
•
4
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl
Updated
May 3
•
4
RyanYr/grpo-dapo-qwen2.5-math-1.5B-n4
Updated
May 3
RyanYr/grpo-dapo-qwen3-1.7B-Base-mbs128-n4
Updated
Apr 20
RyanYr/grpo-dapo_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 25
•
4
RyanYr/grpo-dapo-01_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 25