What's the best agent for my domain?

Best Agent + Model Combination

Average success rate (%) by task complexity. Higher curves indicate better performance.

Best Agents

Best Models