Sergio Paniego PRO
AI & ML interests
Recent Activity
Organizations
Posts 96
Starting today, it's coordinated by a committee that includes Meta-PyTorch, Reflection, Unsloth, Modal, Prime Intellect, Nvidia, Mercor, Fleet AI, and Hugging Face
frontier labs train their models and their harnesses together. Claude knows Claude Code. GPT-5.5 knows Codex. that's not an accident, it's training. open-source models deserve the same magic, but pulling that off requires infrastructure that belongs to everyone, not one lab
OpenEnv is that layer. one api, any harness, any trainer, any environment
Rewards and training loops stay in TRL, Unsloth, wherever you already work. OpenEnv is the socket they all plug into
Get involved!
Full announcement: https://huggingface.co/blog/openenv-agentic-rl
Articles 20
The Open Source Community is backing OpenEnv for Agentic RL
- Runtime errorRL
CARLA Environment Server
🚗Control a Carla driving simulation with custom actions
- Runtime errorRL
CARLA Environment Server
🚗Control a CARLA driving simulator with custom actions
- SleepingAgents
Carla Grpo Trolley
🚀Visualize your program’s I/O activity in real time
-
sergiopaniego/Qwen3-0.6B-carla-trolley-escape
0.8B • Updated • 5
- Running3.88k
The Ultra-Scale Playbook
🌌3.88kThe ultimate guide to training LLM on large GPU Clusters
- Running on CPU UpgradeFeatured3.2k
The Smol Training Playbook
📚3.2kThe secrets to building world-class LLMs
- Running325
Evaluation Guidebook
📝325Explore LLM benchmark scores over time
- Running225
FineVision: Open Data is All You Need
📝225A new open-source dataset for training VLMs
- Runtime errorRL
CARLA Environment Server
🚗Control a Carla driving simulation with custom actions
- Runtime errorRL
CARLA Environment Server
🚗Control a CARLA driving simulator with custom actions
- SleepingAgents
Carla Grpo Trolley
🚀Visualize your program’s I/O activity in real time
-
sergiopaniego/Qwen3-0.6B-carla-trolley-escape
0.8B • Updated • 5
- Running3.88k
The Ultra-Scale Playbook
🌌3.88kThe ultimate guide to training LLM on large GPU Clusters
- Running on CPU UpgradeFeatured3.2k
The Smol Training Playbook
📚3.2kThe secrets to building world-class LLMs
- Running325
Evaluation Guidebook
📝325Explore LLM benchmark scores over time
- Running225
FineVision: Open Data is All You Need
📝225A new open-source dataset for training VLMs
spaces 139
VLM Object Understanding
Explore object detection, visual grounding, keypoint Detecti
Qwen2-VL-7B
Ask questions about charts in images
SmolVLM-trl-dpo-rlaif-v
Generate text from an image and question
SmolVLM-trl-sft-ChartQA
Ask questions about charts in images
Huggingface Static C3b61b
View and manage your tracking data in an interactive dashboard
Huggingface Static A08598
View project metrics on an interactive dashboard