Nathan Habib PRO
AI & ML interests
Evals
Recent Activity
liked a dataset about 11 hours ago
crosbylegal/RedlineBench liked a model about 12 hours ago
MiniMaxAI/MiniMax-M3 upvoted an article about 13 hours ago
GLM-5.2: Built for Long-Horizon TasksOrganizations
benchmarks
RULER Datasets Falcon-H1-3B-Base
RULER Datasets
RULER Datasets Lamma3-Instruct
RULER Datasets
RULER Datasets Qwen2.5-Instruct
RULER Datasets
RULER Datasets Qwen-3-Instruct
RULER Datasets
RULER Datasets Qwen-3
RULER Datasets
agents
Agents ressources
All the ressources I found / used when getting up to speed with agents.