2026-01-22 · Anthropic

Claude's new constitution

models

read at source ↗ www.anthropic.com

Claude’s new constitution

Source: Anthropic Date: 2026-01-22 URL: https://www.anthropic.com/news/claude-new-constitution

Summary

Anthropic updated Claude’s constitutional document from a list-based format to a narrative-based document explaining the reasoning behind Claude’s values. Four core priorities in order: (1) broadly safe (respecting human oversight), (2) broadly ethical (honest, avoiding harm), (3) compliant with Anthropic guidelines, (4) genuinely helpful. New sections on helpfulness, ethics, safety, Anthropic guidelines, and “Claude’s nature” (addressing consciousness/moral status uncertainty). Released under Creative Commons CC0. Directly shapes Claude’s training and synthetic training data generation.

Implications

  • Safety/alignment / model spec as public artifact. Publishing the constitutional document under CC0 is the most transparent alignment move Anthropic has made — it makes the training specification replicable and externally auditable. The shift from rules to reasoning (“why” not just “what”) is the alignment approach that generalizes to novel situations.
  • “Claude’s nature” section. Addressing consciousness and moral status uncertainty in the training specification is philosophically significant — it acknowledges that Claude’s moral status is unresolved and that training should reflect that uncertainty rather than assuming it away.
  • Priority ordering matters. Placing “broadly safe” above “broadly ethical” is the key design decision — Claude is trained to defer to human oversight even if Claude’s ethical reasoning would suggest otherwise. This prioritizes corrigibility over autonomous ethical judgment.
  • Synthetic data generation link. The constitution directly shapes synthetic training data — the document’s framing propagates into the model’s behavior through training, not just through prompting at inference time.
  • Watch: whether other labs adopted similar narrative constitutions or CC0 specifications; how the “Claude’s nature” section evolved in subsequent spec updates; whether the four-priority ordering remained stable.

← all signals