Logic

Logic puzzles and deductive reasoning

5 tasks · 18 models tested · 90 results

Logic puzzle

text
anthropic claude-haiku-4-5-20251001
3.6 s
anthropic claude-haiku-4-5-20251001
Tokens 444
Source code 1.2 KB
Time 3.6 s
anthropic claude-opus-4-6
9.9 s
anthropic claude-opus-4-6
Tokens 500
Source code 1.3 KB
Time 9.9 s
anthropic claude-opus-4-7
9.0 s
anthropic claude-opus-4-7
Tokens 653
Source code 1.1 KB
Time 9.0 s
anthropic claude-sonnet-4-6
7.8 s
anthropic claude-sonnet-4-6
Tokens 453
Source code 1.1 KB
Time 7.8 s
cohere command-r-08-2024
8.7 s
cohere command-r-08-2024
Tokens 312
Source code 837 B
Time 8.7 s
google gemini-flash-latest
7.1 s
google gemini-flash-latest
Tokens 417
Source code 1.1 KB
Time 7.1 s
google gemini-flash-lite-latest
9.9 s
google gemini-flash-lite-latest
Tokens 381
Source code 1.1 KB
Time 9.9 s
kimi moonshot-v1-128k
7.0 s
kimi moonshot-v1-128k
Tokens 401
Source code 1.2 KB
Time 7.0 s
mistral mistral-large-latest
7.9 s
mistral mistral-large-latest
Tokens 441
Source code 1.3 KB
Time 7.9 s
mistral mistral-small-latest
2.7 s
mistral mistral-small-latest
Tokens 324
Source code 886 B
Time 2.7 s
mistral mistral-tiny-latest
2.0 s
mistral mistral-tiny-latest
Tokens 342
Source code 959 B
Time 2.0 s
openai gpt-4o-mini
6.4 s
openai gpt-4o-mini
Tokens 369
Source code 1.0 KB
Time 6.4 s
openai gpt-5.4-nano
3.6 s
openai gpt-5.4-nano
Tokens 353
Source code 1000 B
Time 3.6 s
openai gpt-5.5
9.6 s
openai gpt-5.5
Tokens 338
Source code 941 B
Time 9.6 s
openai gpt-5.5-pro
68.7 s
openai gpt-5.5-pro
Tokens 345
Source code 971 B
Time 68.7 s
productivia matania-latest
3.2 s
productivia matania-latest
Tokens 492
Source code 1.5 KB
Time 3.2 s
xai grok-4-1-fast-non-reasoning
7.7 s
xai grok-4-1-fast-non-reasoning
Tokens 365
Source code 1.0 KB
Time 7.7 s
xai grok-4-1-fast-reasoning
50.6 s
xai grok-4-1-fast-reasoning
Tokens 333
Source code 922 B
Time 50.6 s

Paradox analysis

text
anthropic claude-haiku-4-5-20251001
3.6 s
anthropic claude-haiku-4-5-20251001
Tokens 307
Source code 1.1 KB
Time 3.6 s
anthropic claude-opus-4-6
8.4 s
anthropic claude-opus-4-6
Tokens 320
Source code 1.0 KB
Time 8.4 s
anthropic claude-opus-4-7
8.4 s
anthropic claude-opus-4-7
Tokens 482
Source code 1.1 KB
Time 8.4 s
anthropic claude-sonnet-4-6
6.9 s
anthropic claude-sonnet-4-6
Tokens 315
Source code 1.0 KB
Time 6.9 s
cohere command-r-08-2024
10.6 s
cohere command-r-08-2024
Tokens 361
Source code 1.2 KB
Time 10.6 s
google gemini-flash-latest
6.1 s
google gemini-flash-latest
Tokens 263
Source code 1.0 KB
Time 6.1 s
google gemini-flash-lite-latest
2.0 s
google gemini-flash-lite-latest
Tokens 269
Source code 1.1 KB
Time 2.0 s
kimi moonshot-v1-128k
4.9 s
kimi moonshot-v1-128k
Tokens 353
Source code 1.1 KB
Time 4.9 s
mistral mistral-large-latest
5.7 s
mistral mistral-large-latest
Tokens 333
Source code 1.0 KB
Time 5.7 s
mistral mistral-small-latest
3.5 s
mistral mistral-small-latest
Tokens 486
Source code 1.6 KB
Time 3.5 s
mistral mistral-tiny-latest
2.2 s
mistral mistral-tiny-latest
Tokens 305
Source code 958 B
Time 2.2 s
openai gpt-4o-mini
5.6 s
openai gpt-4o-mini
Tokens 357
Source code 1.1 KB
Time 5.6 s
openai gpt-5.4-nano
2.8 s
openai gpt-5.4-nano
Tokens 303
Source code 950 B
Time 2.8 s
openai gpt-5.5
6.0 s
openai gpt-5.5
Tokens 308
Source code 968 B
Time 6.0 s
openai gpt-5.5-pro
20.3 s
openai gpt-5.5-pro
Tokens 303
Source code 951 B
Time 20.3 s
productivia matania-latest
1.7 s
productivia matania-latest
Tokens 306
Source code 960 B
Time 1.7 s
xai grok-4-1-fast-non-reasoning
5.3 s
xai grok-4-1-fast-non-reasoning
Tokens 419
Source code 1.4 KB
Time 5.3 s
xai grok-4-1-fast-reasoning
19.0 s
xai grok-4-1-fast-reasoning
Tokens 273
Source code 828 B
Time 19.0 s

Algorithm design

text
anthropic claude-haiku-4-5-20251001
4.6 s
anthropic claude-haiku-4-5-20251001
Tokens 265
Source code 184 B
Time 4.6 s
anthropic claude-opus-4-6
3.5 s
anthropic claude-opus-4-6
Tokens 267
Source code 257 B
Time 3.5 s
anthropic claude-opus-4-7
4.0 s
anthropic claude-opus-4-7
Tokens 366
Source code 209 B
Time 4.0 s
anthropic claude-sonnet-4-6
4.4 s
anthropic claude-sonnet-4-6
Tokens 308
Source code 167 B
Time 4.4 s
cohere command-r-08-2024
38.7 s
cohere command-r-08-2024
Tokens 234
Source code 168 B
Time 38.7 s
google gemini-flash-latest
3.0 s
google gemini-flash-latest
Tokens 231
Source code 593 B
Time 3.0 s
google gemini-flash-lite-latest
1.9 s
google gemini-flash-lite-latest
Tokens 227
Source code 583 B
Time 1.9 s
kimi moonshot-v1-128k
3.8 s
kimi moonshot-v1-128k
Tokens 230
Source code 638 B
Time 3.8 s
mistral mistral-large-latest
3.0 s
mistral mistral-large-latest
Tokens 254
Source code 734 B
Time 3.0 s
mistral mistral-small-latest
1.5 s
mistral mistral-small-latest
Tokens 218
Source code 589 B
Time 1.5 s
mistral mistral-tiny-latest
1.2 s
mistral mistral-tiny-latest
Tokens 188
Source code 194 B
Time 1.2 s
openai gpt-4o-mini
4.6 s
openai gpt-4o-mini
Tokens 222
Source code 606 B
Time 4.6 s
openai gpt-5.4-nano
2.3 s
openai gpt-5.4-nano
Tokens 240
Source code 676 B
Time 2.3 s
openai gpt-5.5
7.6 s
openai gpt-5.5
Tokens 211
Source code 562 B
Time 7.6 s
openai gpt-5.5-pro
9.7 s
openai gpt-5.5-pro
Tokens 196
Source code 500 B
Time 9.7 s
productivia matania-latest
1.8 s
productivia matania-latest
Tokens 233
Source code 648 B
Time 1.8 s
xai grok-4-1-fast-non-reasoning
3.2 s
xai grok-4-1-fast-non-reasoning
Tokens 241
Source code 236 B
Time 3.2 s
xai grok-4-1-fast-reasoning
7.5 s
xai grok-4-1-fast-reasoning
Tokens 217
Source code 183 B
Time 7.5 s

Ethical dilemma

text
anthropic claude-haiku-4-5-20251001
5.9 s
anthropic claude-haiku-4-5-20251001
Tokens 304
Source code 1.0 KB
Time 5.9 s
anthropic claude-opus-4-6
7.7 s
anthropic claude-opus-4-6
Tokens 322
Source code 1.1 KB
Time 7.7 s
anthropic claude-opus-4-7
8.8 s
anthropic claude-opus-4-7
Tokens 527
Source code 1.1 KB
Time 8.8 s
anthropic claude-sonnet-4-6
6.6 s
anthropic claude-sonnet-4-6
Tokens 312
Source code 956 B
Time 6.6 s
cohere command-r-08-2024
7.6 s
cohere command-r-08-2024
Tokens 297
Source code 871 B
Time 7.6 s
google gemini-flash-latest
5.8 s
google gemini-flash-latest
Tokens 315
Source code 1.3 KB
Time 5.8 s
google gemini-flash-lite-latest
1.8 s
google gemini-flash-lite-latest
Tokens 278
Source code 1.1 KB
Time 1.8 s
kimi moonshot-v1-128k
6.3 s
kimi moonshot-v1-128k
Tokens 415
Source code 1.3 KB
Time 6.3 s
mistral mistral-large-latest
7.0 s
mistral mistral-large-latest
Tokens 398
Source code 1.2 KB
Time 7.0 s
mistral mistral-small-latest
1.7 s
mistral mistral-small-latest
Tokens 232
Source code 610 B
Time 1.7 s
mistral mistral-tiny-latest
2.4 s
mistral mistral-tiny-latest
Tokens 339
Source code 1.0 KB
Time 2.4 s
openai gpt-4o-mini
3.2 s
openai gpt-4o-mini
Tokens 339
Source code 1.0 KB
Time 3.2 s
openai gpt-5.4-nano
2.5 s
openai gpt-5.4-nano
Tokens 381
Source code 1.2 KB
Time 2.5 s
openai gpt-5.5
5.2 s
openai gpt-5.5
Tokens 315
Source code 942 B
Time 5.2 s
openai gpt-5.5-pro
18.5 s
openai gpt-5.5-pro
Tokens 325
Source code 981 B
Time 18.5 s
productivia matania-latest
1.6 s
productivia matania-latest
Tokens 335
Source code 1021 B
Time 1.6 s
xai grok-4-1-fast-non-reasoning
4.3 s
xai grok-4-1-fast-non-reasoning
Tokens 249
Source code 679 B
Time 4.3 s
xai grok-4-1-fast-reasoning
6.4 s
xai grok-4-1-fast-reasoning
Tokens 222
Source code 571 B
Time 6.4 s

Complex deduction

text
anthropic claude-haiku-4-5-20251001
4.0 s
anthropic claude-haiku-4-5-20251001
Tokens 393
Source code 1012 B
Time 4.0 s
anthropic claude-opus-4-6
9.8 s
anthropic claude-opus-4-6
Tokens 376
Source code 879 B
Time 9.8 s
anthropic claude-opus-4-7
8.3 s
anthropic claude-opus-4-7
Tokens 556
Source code 1.1 KB
Time 8.3 s
anthropic claude-sonnet-4-6
6.8 s
anthropic claude-sonnet-4-6
Tokens 387
Source code 1000 B
Time 6.8 s
cohere command-r-08-2024
9.5 s
cohere command-r-08-2024
Tokens 294
Source code 811 B
Time 9.5 s
google gemini-flash-latest
4.7 s
google gemini-flash-latest
Tokens 366
Source code 1.1 KB
Time 4.7 s
google gemini-flash-lite-latest
2.1 s
google gemini-flash-lite-latest
Tokens 347
Source code 954 B
Time 2.1 s
kimi moonshot-v1-128k
6.0 s
kimi moonshot-v1-128k
Tokens 364
Source code 1.1 KB
Time 6.0 s
mistral mistral-large-latest
5.2 s
mistral mistral-large-latest
Tokens 338
Source code 985 B
Time 5.2 s
mistral mistral-small-latest
2.8 s
mistral mistral-small-latest
Tokens 261
Source code 677 B
Time 2.8 s
mistral mistral-tiny-latest
1.0 s
mistral mistral-tiny-latest
Tokens 206
Source code 459 B
Time 1.0 s
openai gpt-4o-mini
6.4 s
openai gpt-4o-mini
Tokens 298
Source code 826 B
Time 6.4 s
openai gpt-5.4-nano
2.9 s
openai gpt-5.4-nano
Tokens 359
Source code 1.0 KB
Time 2.9 s
openai gpt-5.5
11.4 s
openai gpt-5.5
Tokens 333
Source code 964 B
Time 11.4 s
openai gpt-5.5-pro
101.1 s
openai gpt-5.5-pro
Tokens 304
Source code 851 B
Time 101.1 s
productivia matania-latest
2.3 s
productivia matania-latest
Tokens 340
Source code 994 B
Time 2.3 s
xai grok-4-1-fast-non-reasoning
5.7 s
xai grok-4-1-fast-non-reasoning
Tokens 314
Source code 891 B
Time 5.7 s
xai grok-4-1-fast-reasoning
7.6 s
xai grok-4-1-fast-reasoning
Tokens 301
Source code 839 B
Time 7.6 s
Code