kwj.ai · acquisition inquiries from >$999view prospectus →
The Domesday Book ofKWJ · AI

Volume I · The Directory

Agents and models, in current use

Every entry below is something an operator might reach for in 2026. Each profile carries pricing, context, license, and the linguistic fingerprints by which the model can be identified from its output alone.

Frontier models

Claude Opus 4.7

Anthropic · X

Anthropic's flagship at the time of this entry: extended thinking on, 1M-token context, the most patient of the frontier models.

Claude Sonnet 4.6

Anthropic · X

The workhorse. Faster than Opus, smarter than Haiku, the model most production deployments end up on.

Claude Haiku 4.5

Anthropic · X

The fast one. Used for sub-agents, high-volume classification, and the parts of an agent loop where capability matters less than throughput.

GPT-5

OpenAI · IX

OpenAI's reasoning-first flagship. Native chain-of-thought, three reasoning-effort tiers, the highest published benchmark scores at release.

GPT-4.5

OpenAI · VII

The last non-reasoning OpenAI flagship. Better at vibes than reasoning. Quietly deprecated in favor of GPT-5.

Gemini 2.5 Pro

Google DeepMind · IX

Google's reasoning flagship. Two-million-token context, native multimodal, the only frontier model that reads PDFs without an extraction pre-pass.

Mistral Large 3

Mistral · X

The European frontier model. Codes well, refuses politely, and keeps your data in the EU.

Grok 4

xAI · IX

Elon's reasoning flagship. Native Twitter/X integration, willing to discuss what other models won't.

OpenAI o3

OpenAI · IX

OpenAI's peak reasoning model before GPT-5. AIME, ARC-AGI, and SWE-bench records at release.

Claude 3.5 Sonnet

Anthropic · VII

The model that made Claude the coding default. Coding benchmarks beat GPT-4o at launch.

Gemini Flash 2.0

Google DeepMind · X

Google's workhorse: fast, cheap, multimodal. The Sonnet of the Gemini family.

Command R+

Cohere · VII

Enterprise RAG specialist. Built for retrieval pipelines, not chat.

Claude 2

Anthropic · IV

Anthropic's first publicly competitive release. 100K context when GPT-4 had 8K.

Gemini 1.5 Pro

Google DeepMind · VII

The model that proved 1M-token context was practical. Video and audio native.

Claude 3 Opus

Anthropic · VII

The model that finally made the 'Claude is better than GPT-4' argument defensible.

Gemini Ultra 1.0

Google DeepMind · VII

Google's first frontier-class model, released to counter GPT-4 and Claude 3.

GPT-4o

OpenAI · V

The omni model. Text, image, audio natively in one system. Speed doubled vs. GPT-4.

GPT-4 Turbo

OpenAI · VII

128K context, knowledge cutoff updated, cost dropped 3×. The model that made GPT-4 accessible.

Open-weights models

Coding agents

General-purpose agent frameworks

Computer-use agents

Specialty agents