3 13 3

MikaStars39

https://mikastars39.notion.site

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

MiniMax Sparse Attention

upvoted a paper 3 days ago

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

liked a model 3 months ago

PolarSeeker/OpenSeeker-v1-30B-SFT

View all activity

Organizations

upvoted 2 papers 3 days ago

MiniMax Sparse Attention

Paper • 2606.13392 • Published 7 days ago • 137

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 7 days ago • 88

liked a model 3 months ago

PolarSeeker/OpenSeeker-v1-30B-SFT

Text Generation • 31B • Updated Mar 17 • 312 • 12

updated a dataset 4 months ago

MikaStars39/nano-eval

Preview • Updated Mar 3 • 99

published a dataset 4 months ago

MikaStars39/nano-eval

Preview • Updated Mar 3 • 99

upvoted a paper 6 months ago

Evaluating Parameter Efficient Methods for RLVR

Paper • 2512.23165 • Published Dec 29, 2025 • 28

submitted a paper to Daily Papers 6 months ago

Evaluating Parameter Efficient Methods for RLVR

Paper • 2512.23165 • Published Dec 29, 2025 • 28

updated a model 7 months ago

MikaStars39/PeRL

Updated Dec 2, 2025 • 3

published a model 7 months ago

MikaStars39/PeRL

Updated Dec 2, 2025 • 3

published a Space 7 months ago

Open Tinker

🐠

updated a model 7 months ago

FlashOmni/rwkv7-0.4b-train-encoder

Updated Nov 6, 2025

liked a Space 8 months ago

The Smol Training Playbook

📚

3.21k

The secrets to building world-class LLMs

published a model 8 months ago

FlashOmni/rwkv7-0.4b-train-encoder

Updated Nov 6, 2025

updated a Space 8 months ago

README

🦀

published a Space 8 months ago

README

🦀

upvoted 4 papers 8 months ago

Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning

Paper • 2510.23473 • Published Oct 27, 2025 • 87

Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding

Paper • 2505.16990 • Published May 22, 2025 • 22

REPAIR: Robust Editing via Progressive Adaptive Intervention and Reintegration

Paper • 2510.01879 • Published Oct 2, 2025 • 8

Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?

Paper • 2510.06036 • Published Oct 7, 2025 • 7

commented a paper 8 months ago

Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?

Paper • 2510.06036 • Published Oct 7, 2025 • 7 •

MikaStars39

AI & ML interests

Recent Activity

Organizations

MikaStars39's activity

Open Tinker

The Smol Training Playbook

README

README