Kai Hua's picture

Hiring 💼

Kai Hua

kkish

·

https://kifish.github.io

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

AIR: Post-training Data Selection for Reasoning via Attention Head Influence

upvoted a paper 2 days ago

LLMs are Also Effective Embedding Models: An In-depth Overview

upvoted a paper 5 days ago

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

View all activity

Organizations

upvoted 2 papers 2 days ago

AIR: Post-training Data Selection for Reasoning via Attention Head Influence

Paper • 2512.13279 • Published Dec 15, 2025 • 2

LLMs are Also Effective Embedding Models: An In-depth Overview

Paper • 2412.12591 • Published Dec 17, 2024 • 2

upvoted 2 papers 5 days ago

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 8 days ago • 89

MiniMax Sparse Attention

Paper • 2606.13392 • Published 8 days ago • 139

updated a collection 5 days ago

Model Released

3 items • Updated 5 days ago

liked a model 5 days ago

MiniMaxAI/MiniMax-M3

Image-Text-to-Text • 427B • Updated 3 days ago • 56.2k • • 1.1k

reacted to FlameF0X's post with 🚀🔥 16 days ago

Post

7132

MiniMax-M3 coming soon.
https://github.com/MiniMax-AI/MiniMax-M3

upvoted a paper 29 days ago

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

Paper • 2605.19577 • Published May 19 • 59

updated a collection 29 days ago

Model Released

3 items • Updated 5 days ago

liked a dataset 29 days ago

Kwai-Klear/GoLongRL

Viewer • Updated 24 days ago • 23k • 1.57k • 23

upvoted a collection 30 days ago

Qwen3-Reranker

3 items • Updated Dec 31, 2025 • 71

upvoted a paper about 1 month ago

OProver: A Unified Framework for Agentic Formal Theorem Proving

Paper • 2605.17283 • Published May 17 • 31

liked a dataset about 1 month ago

xiyuRenBill/MEMLENS

Viewer • Updated 5 days ago • 3.16k • 5.26k • 8

upvoted 2 papers about 1 month ago

STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?

Paper • 2605.06527 • Published May 7 • 46

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Paper • 2605.13831 • Published May 13 • 87

updated a collection about 2 months ago

Model Released

3 items • Updated 5 days ago

liked 2 models 2 months ago

Qwen/Qwen3.6-35B-A3B

Image-Text-to-Text • 36B • Updated Apr 24 • 4.35M • • 2.16k

MiniMaxAI/MiniMax-M2.7

Text Generation • 229B • Updated Apr 20 • 2.46M • • 1.22k

updated a collection 2 months ago

Seed Flagship Model Released

contributed • 8 items • Updated Apr 13 • 3