-
Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation
Paper • 2503.22675 • Published • 36 -
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback
Paper • 2503.22230 • Published • 45 -
ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization
Paper • 2509.13313 • Published • 80 -
WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Paper • 2509.13309 • Published • 68
bypan
bypan123
·
AI & ML interests
None yet
Recent Activity
upvoted a paper about 4 hours ago
LIMMT: Less is More for Motion Tracking liked a model 4 months ago
UBTECH-Robotics/Thinker-4B updated a model 4 months ago
UBTECH-Robotics/Thinker-4B