LMSYS ArenaGemini 3 Pro#1
SWE-benchOpus 4.680.8%
ARC-AGI 2Opus 4.668.8%
GPQA DiamondGPT-5.293.2%
Terminal-BenchOpus 4.665.4%
BrowseCompOpus 4.684.0%
MMMLUGemini 3 Pro91.8%
Humanity's Last ExamOpus 4.653.1%
OSWorldOpus 4.672.7%
GDPval-AAOpus 4.61606 Elo
Finance AgentOpus 4.660.7%
OpenClaw145k+ starsviral
LMSYS ArenaGemini 3 Pro#1
SWE-benchOpus 4.680.8%
ARC-AGI 2Opus 4.668.8%
GPQA DiamondGPT-5.293.2%
Terminal-BenchOpus 4.665.4%
BrowseCompOpus 4.684.0%
MMMLUGemini 3 Pro91.8%
Humanity's Last ExamOpus 4.653.1%
OSWorldOpus 4.672.7%
GDPval-AAOpus 4.61606 Elo
Finance AgentOpus 4.660.7%
OpenClaw145k+ starsviral
TD

Thierry Damiba

// Building the agentic work stack.

Developer Evangelist at Arcade. Founder, Scale Intelligence. I write playbooks from the field on agents, GTM engineering, and what happens when machines start doing the work.

Latest

The Problem Is Not Delegation

Read the essay

Building something in the agent space? Book a call →