2025-06-18 · Anthropic

Confidential Inference via Trusted Virtual Machines

infrastructure

Confidential Inference via Trusted Virtual Machines

Source: Anthropic Research Date: 2025-06-18 URL: https://www.anthropic.com/research/confidential-inference-trusted-vms

Summary

Architecture for confidential AI inference: a small trusted loader runs in an isolated VM with encrypted memory, disabled debugging, and hardware-attested code execution (TPM). Model weights and user data flow through encrypted pipelines; decryption occurs only inside the restricted environment. Designed to address gaps where GPU/TPU hardware accelerators lack native confidential computing support.

Implications

This is the model weight security + user privacy thread at the infrastructure level. The threat model is sophisticated: Anthropic needs to protect model weights from insider threats and nation-state actors while also providing cryptographic privacy guarantees to users. Confidential computing via attestable VMs is the standard approach for sensitive data processing, but applying it to LLM inference at scale is non-trivial given accelerator limitations. Watch for this becoming a compliance requirement for regulated industry deployments (healthcare, finance, government) and for Anthropic offering this as a premium tier with verifiable privacy guarantees. The weight protection angle also matters for IP protection as model values increase.

← all signals