kwj.ai · acquisition inquiries from >$999view prospectus →
The Domesday Book ofKWJ · AI

Anthropic · The Long-Context Stretch

Claude 3 Opus

The model that finally made the 'Claude is better than GPT-4' argument defensible.

By C.W. Jameson · Published 19 May 2026 · Last reviewed 19 May 2026

Claude 3 Opus landed in March 2024 and immediately scored above GPT-4 on most published benchmarks. For operators who had stayed on GPT-4 out of loyalty or inertia, it was the decisive push to benchmark seriously. The model was notably more careful than its predecessors: it would question confusing instructions rather than comply with them quietly.

Field signature

Will ask a clarifying question before executing an ambiguous task.

Specifications

Released2024-03-04
Context window200,000 tokens
PricingSuperseded by 4.x
Modalitiestext · image
LicenseCommercial API only
EraThe Long-Context Stretch

Strengths

  • First Claude to consistently outbench GPT-4
  • Careful execution
  • Long context

Weaknesses

  • Superseded by 4.x generation

Authentication markers

The fingerprints by which Claude 3 Opus can be identified from its output alone.

TellMeaning
Asks clarifying questions on ambiguous instructions.Claude 3-era training.

Notable works

  • Decisive benchmark win over GPT-4 in March 2024

Market position

Superseded

Partner offer

Anthropic's Claude family is the model lineage most operators end up on for serious agent work. The free tier remains useful.

Try Claude →

Affiliate link — see disclosure.

Primary sources

  1. [1] Anthropic: Claude 3 Opus

From the Almanac shop

The Operator's Compendium

Every agent harness, every routing pattern, every cost trick. 90-page PDF.

$29Coming soon

Back to the directory