Anthropic · The Long-Context Stretch
Claude 3 Opus
The model that finally made the 'Claude is better than GPT-4' argument defensible.
By C.W. Jameson · Published 19 May 2026 · Last reviewed 19 May 2026
Claude 3 Opus landed in March 2024 and immediately scored above GPT-4 on most published benchmarks. For operators who had stayed on GPT-4 out of loyalty or inertia, it was the decisive push to benchmark seriously. The model was notably more careful than its predecessors: it would question confusing instructions rather than comply with them quietly.
Field signature
Will ask a clarifying question before executing an ambiguous task.
Specifications
| Released | 2024-03-04 |
|---|---|
| Context window | 200,000 tokens |
| Pricing | Superseded by 4.x |
| Modalities | text · image |
| License | Commercial API only |
| Era | The Long-Context Stretch |
Strengths
- First Claude to consistently outbench GPT-4
- Careful execution
- Long context
Weaknesses
- Superseded by 4.x generation
Authentication markers
The fingerprints by which Claude 3 Opus can be identified from its output alone.
| Tell | Meaning |
|---|---|
| Asks clarifying questions on ambiguous instructions. | Claude 3-era training. |
Notable works
- Decisive benchmark win over GPT-4 in March 2024
Market position
Superseded
Partner offer
Anthropic's Claude family is the model lineage most operators end up on for serious agent work. The free tier remains useful.
Try Claude →Affiliate link — see disclosure.
Primary sources
From the Almanac shop
The Operator's Compendium
Every agent harness, every routing pattern, every cost trick. 90-page PDF.
$29 — Coming soon