2 25 7

Kong

csfufu

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

InterleaveThinker: Reinforcing Agentic Interleaved Generation

upvoted a paper 6 days ago

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

upvoted a paper 8 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

View all activity

Organizations

upvoted 2 papers 6 days ago

InterleaveThinker: Reinforcing Agentic Interleaved Generation

Paper • 2606.13679 • Published 7 days ago • 79

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 7 days ago • 89

upvoted 2 papers 8 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 9 days ago • 41

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 10 days ago • 33

upvoted 3 papers about 1 month ago

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

Paper • 2605.10832 • Published May 11 • 22

Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published May 8 • 101

4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding

Paper • 2605.05997 • Published May 7 • 18

updated a dataset about 1 month ago

OpenSearch-VL/Search-VL-SFT-36K

Preview • Updated May 9 • 657 • 9

liked 3 models about 1 month ago

updated 3 models about 1 month ago

OpenSearch-VL/OpenSearch-VL-32B

1.14M • Updated May 7 • 4 • 2

OpenSearch-VL/OpenSearch-VL-30B-A3B

665k • Updated May 7 • 2

OpenSearch-VL/OpenSearch-VL-8B

770k • Updated May 7 • 19 • 5

updated a dataset about 1 month ago

OpenSearch-VL/Search-VL-RL-8K

Updated May 7 • 184 • 5

liked 2 datasets about 1 month ago

OpenSearch-VL/Search-VL-RL-8K

Updated May 7 • 184 • 5

OpenSearch-VL/Search-VL-SFT-36K

Preview • Updated May 9 • 657 • 9

upvoted a paper about 1 month ago

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper • 2605.05185 • Published May 6 • 103

submitted a paper to Daily Papers about 1 month ago

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper • 2605.05185 • Published May 6 • 103

published a dataset about 1 month ago

OpenSearch-VL/Search-VL-SFT-36K

Preview • Updated May 9 • 657 • 9

Kong

AI & ML interests

Recent Activity

Organizations

csfufu's activity