ClawBench Logo
ClawBenchLLM Agent Benchmark

CLAW SCORE

Percentage of all evaluations resolved in ClawBench; Higher is Better

01
Z.ai logo
GLM-5-Turbo
93.9
02
ByteDance logo
Doubao-Seed-2.0-lite
93.1
03
OpenAI logo
GPT-5.4
92.2
04
MiniMax logo
MiniMax-M2.5
92.1
05
MiniMax logo
MiniMax-M2.7
91.7
06
Z.ai logo
GLM-5
91.7
07
Anthropic logo
Claude Opus 4.5
91.5
08
Alibaba logo
Qwen3.5-35B-A3B
91.4
09
Xiaomi logo
MiMo-V2-Omni
91.2
10
Alibaba logo
Qwen3.5-397B-A17B
90.0

SPEED

Time (s) to run all evaluations in the ClawBench; Lower is better

01
xAI logo
Grok 4.20 Beta
524s
02
OpenAI logo
gpt-oss-20b
530s
03
OpenAI logo
GPT-5.4 Mini
589s
04
OpenAI logo
GPT-5.4 Nano
649s
05
Google logo
Gemini 3 Flash Preview
666s
06
Xiaomi logo
MiMo-V2-Omni
848s
07
OpenAI logo
gpt-oss-120b
1218s
08
OpenAI logo
GPT-5.4
1292s
09
Nvidia logo
Nemotron 3 Nano
1298s
10
Z.ai logo
GLM-5-Turbo
1317s

COST

Cost (USD) to run all evaluations in the ClawBench; Lower is better

01
OpenAI logo
gpt-oss-20b
$0.08
02
OpenAI logo
GPT-5.4 Nano
$0.17
03
OpenAI logo
gpt-oss-120b
$0.18
04
StepFun logo
Step 3.5 Flash
$0.28
05
DeepSeek logo
DeepSeek-V3.2(Non-thinking)
$0.32
06
ByteDance logo
Doubao-Seed-2.0-lite
$0.33
07
xAI logo
Grok 4.1 Fast
$0.33
08
MiniMax logo
MiniMax-M2.5
$0.38
09
MiniMax logo
MiniMax-M2.7
$0.44
10
Anthropic logo
Claude Sonnet 4.5
$0.49
Updated 03/23/2026
ClawBench Logo
ClawBenchLLM Agent Benchmark
Model
CLAW SCORE
Speed
Cost
Value
Report
Z.ai logo
GLM-5-TurboProprietary
Z.ai
93.9
1317s$0.83113.1
ByteDance logo
Doubao-Seed-2.0-liteProprietary
ByteDance
93.1
1793s$0.33282.1
OpenAI logo
GPT-5.4Proprietary
OpenAI
92.2
1292s$2.1143.7
MiniMax logo
MiniMax-M2.5Proprietary
MiniMax
92.1
1908s$0.38242.3
MiniMax logo
MiniMax-M2.7Proprietary
MiniMax
91.7
2003s$0.44208.5
Z.ai logo
GLM-5Open Weights
Z.ai
91.7
2377s$1.3070.5
Anthropic logo
Claude Opus 4.5Proprietary
Anthropic
91.5
1556s$9.859.3
Alibaba logo
Qwen3.5-35B-A3BOpen Weights
Alibaba
91.4
1615s$0.56163.3
Xiaomi logo
MiMo-V2-OmniProprietary
Xiaomi
91.2
848s$0.75121.6
Alibaba logo
Qwen3.5-397B-A17BOpen Weights
Alibaba
90.0
1661s$0.85105.8
OpenAI logo
GPT-5.4 NanoProprietary
OpenAI
89.7
649s$0.17527.4
Anthropic logo
Claude Haiku 4.5Proprietary
Anthropic
89.4
1860s$2.1641.4
Xiaomi logo
MiMo-V2-ProProprietary
Xiaomi
89.3
1713s$5.3116.8
ByteDance logo
Doubao-Seed-2.0-proProprietary
ByteDance
88.6
2293s$1.0088.6
xAI logo
Grok 4.1 FastProprietary
xAI
88.6
1441s$0.33268.4
Alibaba logo
Qwen3.5-Plus-2026-02-15Open Weights
Alibaba
88.4
2794s$1.1775.6
Anthropic logo
Claude Opus 4.6Proprietary
Anthropic
88.2
1524s$6.4913.6
Anthropic logo
Claude Sonnet 4.5Proprietary
Anthropic
88.1
1676s$0.49179.8
Google logo
Gemini 3.1 Pro PreviewProprietary
Google
87.7
1891s$2.1241.4
Alibaba logo
Qwen3.5-122B-A10BOpen Weights
Alibaba
86.0
1431s$1.0086.0
https://clawbenchlabs.com

Multidimensional Analysis

ClawBench Logo
ClawBenchLLM Agent Benchmark

Multidimensional Analysis: Office Collaboration

98
97
96
94
94
94
94
94
94
93
93
93
93
93
91
91
91
91
90
89
87
87
83
83
83
82
82
82
80
78
77
77
76
75
73
73
72
69
67
58
Z.ai
GLM-5-Turbo
Anthropic
Claude Opus 4.6
OpenAI
GPT-5.4
MiniMax
MiniMax-M2.7
Alibaba
Qwen3.5-35B-A3B
Alibaba
Qwen3.5-397B-A17B
Xiaomi
MiMo-V2-Pro
Anthropic
Claude Sonnet 4.5
Moonshot AI
Kimi K2 Thinking
ByteDance
Doubao-Seed-2.0-lite
MiniMax
MiniMax-M2.5
Xiaomi
MiMo-V2-Omni
ByteDance
Doubao-Seed-2.0-pro
Alibaba
Qwen3.5-Plus-2026-02-15
Z.ai
GLM-5
StepFun
Step 3.5 Flash
DeepSeek
DeepSeek-V3.2(Non-thinking)
Baidu
ERNIE-5.0-Thinking-Preview
Moonshot AI
Kimi K2.5
DeepSeek
DeepSeek-V3.2(Thinking)
Anthropic
Claude Opus 4.5
OpenAI
GPT-5.4 Nano
Google
Gemini 3.1 Pro Preview
Google
Gemini 2.5 Pro
Alibaba
Qwen3.5-27B
Anthropic
Claude Haiku 4.5
Mistral AI
Mistral Large 3 2512
Anthropic
Claude Sonnet 4.6
xAI
Grok 4.1 Fast
OpenAI
GPT-5.4 Mini
Alibaba
Qwen3.5-122B-A10B
OpenAI
gpt-oss-20b
Google
Gemini 3 Flash Preview
Nvidia
Nemotron 3 Nano
Alibaba
Qwen3-Coder-Next
Amazon
Nova 2 Lite
xAI
Grok 4.20 Beta
Xiaomi
MiMo-V2-Flash
OpenAI
gpt-oss-120b
Nvidia
Nemotron 3 Super
ClawBench Multidimensional Analysis
ClawBench Logo
ClawBench
https://clawbenchlabs.com