2026-01-28 · OpenAI

Keeping your data safe when an AI agent clicks a link

securityagentsmodels

Keeping your data safe when an AI agent clicks a link

Source: OpenAI Date: 2026-01-28 URL: https://openai.com/index/ai-agent-link-safety

Summary

OpenAI safety post from January 2026 addressing prompt injection risks when AI agents browse the web — specifically the attack vector where a malicious web page contains hidden instructions that an AI agent reading the page might follow, causing it to take unintended actions with user data. As ChatGPT Agent and similar tools became capable of autonomously clicking links and interacting with web content, this class of attack moved from theoretical to practically relevant for production deployments.

Implications

Prompt injection is the primary agentic security threat. When agents can take real-world actions (send emails, make purchases, modify files), prompt injection attacks become high-stakes — a malicious web page could direct an agent to exfiltrate data or perform unauthorized actions. The January 2026 timing, shortly after the Axios developer tool compromise response (April 2026), reflected growing awareness of this attack class.

Thread: Agentic AI security. Sits directly alongside the Trusted Access for Cyber announcement (February 2026), the Codex security blog post, and the Axios compromise response as OpenAI’s security-focused communications for the agentic era.

Watch: What specific mitigations OpenAI implemented in ChatGPT Agent and the Responses API to limit prompt injection attack surface, and whether those mitigations were sufficient against real-world adversarial web content.

← all signals