Analysis·8 min
Llama 4: The New Open-Weights Baseline
By C.W. Jameson · Published 20 October 2025 · Last reviewed 20 October 2025
Meta shipped a model that can be self-hosted on eight H100s and matches GPT-4o on most benchmarks. The inference industry responded within two weeks.
An analysis of Llama 4's MoE architecture, benchmark performance, and what it means for the open-weights ecosystem.
Related dispatches