kwj.ai · acquisition inquiries from >$999view prospectus →
The Domesday Book ofKWJ · AI

DeepSeek · The Reasoning-Model Era

DeepSeek R1

The open-weights reasoning model that printed an industry shockwave. Trained at a fraction of frontier-lab costs.

By C.W. Jameson · Published 19 May 2026 · Last reviewed 19 May 2026

R1's release was the first time an open-weights model matched the frontier reasoning benchmarks at a fraction of the training cost. The technical report described a training pipeline simple enough that a hundred labs reproduced it within six weeks. Subsequent open-weights releases now treat R1's recipe as the floor. The model itself is now used primarily as a backbone for fine-tuning rather than direct deployment.

Field signature

Returns long <think> blocks before answers. Reasoning visible.

Specifications

Released2025-01-20
Context window64,000 tokens
PricingFree if self-hosted; ~$0.55/$2.19 per million tokens on inference services
Modalitiestext
LicenseMIT (with use-case carve-outs)
EraThe Reasoning-Model Era

Strengths

  • Cost
  • Open weights
  • Competitive reasoning benchmarks

Weaknesses

  • Older underlying base model (V3)
  • Refusal behavior is unpredictable
  • China-origin data filtering visible on some prompts

Authentication markers

The fingerprints by which DeepSeek R1 can be identified from its output alone.

TellMeaning
<think> blocks before final answer.DeepSeek R1 family.
Refusal patterns include 'I am sorry, I cannot…' verbatim.DeepSeek training artefact.

Notable works

  • Triggering the open-weights reasoning wave of Q1 2025

Market position

$0.55 - $2.19 per million tokens via Together / Fireworks / Groq

Partner offer

Partner offerings listed for operator convenience. See disclosure for terms.

View partner →

Affiliate link — see disclosure.

Primary sources

  1. [1] DeepSeek: Models

From the Almanac shop

The Operator's Compendium

Every agent harness, every routing pattern, every cost trick. 90-page PDF.

$29Coming soon

Back to the directory