ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation Paper • 2605.28293 • Published 14 days ago • 87
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published 14 days ago • 90
BEV-LaneDet: a Simple and Effective 3D Lane Detection Baseline Paper • 2210.06006 • Published Oct 12, 2022 • 3
Running Agents Featured 306 LoRA DreamBooth Training UI ⚡ 306 Train and test custom LoRA DreamBooth models