Confidential Inference via Trusted Virtual Machines
read at source ↗ www.anthropic.com
Confidential Inference via Trusted Virtual Machines
Source: Anthropic Research Date: 2025-06-18 URL: https://www.anthropic.com/research/confidential-inference-trusted-vms
Summary
Architecture for confidential AI inference: a small trusted loader runs in an isolated VM with encrypted memory, disabled debugging, and hardware-attested code execution (TPM). Model weights and user data flow through encrypted pipelines; decryption occurs only inside the restricted environment. Designed to address gaps where GPU/TPU hardware accelerators lack native confidential computing support.
Implications
This is the model weight security + user privacy thread at the infrastructure level. The threat model is sophisticated: Anthropic needs to protect model weights from insider threats and nation-state actors while also providing cryptographic privacy guarantees to users. Confidential computing via attestable VMs is the standard approach for sensitive data processing, but applying it to LLM inference at scale is non-trivial given accelerator limitations. Watch for this becoming a compliance requirement for regulated industry deployments (healthcare, finance, government) and for Anthropic offering this as a premium tier with verifiable privacy guarantees. The weight protection angle also matters for IP protection as model values increase.