daily ·

The Receipts Arrive

May 23, 2026

A quiet release day — one mise patch, one infrastructure-only Claude Code release, empty Codex alphas. But the economic picture sharpened overnight. Zitron published the first concrete Q1 2026 margin data for OpenAI (-122% non-GAAP operating margin), and NVIDIA dropped a genuinely new inference architecture. The releases were minor; the context was not.

Releases

DepVersionDateSignificance
Claude Codev2.1.149May 22Four security fixes including PowerShell permission bypass, /usage breakdown, /diff keyboard scrolling
Claude Codev2.1.150May 23Infrastructure only — no user-facing changes
misev2026.5.15May 23loongarch64 + riscv64 arch support, rattler 0.42→0.43
OpenCodev1.15.10May 23Desktop bugfix — legacy project/session flow restored
Doltv2.0.6May 23REGEXP_REPLACE panic fix, IN query optimization (8-10% for string columns)
tyv0.0.39May 22Python 3.9 stubs dropped, LSP quick-fix for redundant casts
Codex CLIv0.134.0-alpha.1–3May 22–23New alpha marathon begins — all three empty

Claude Code v2.1.149 — security hardening continues

Four security fixes in one release:

  1. PowerShell permission bypass — built-in cd functions (cd.., cd\, cd~, X:) changed the working directory undetected, letting subsequent commands read outside the workspace
  2. Sandbox worktree write allowlist — covered the entire main repo root instead of only the shared .git directory
  3. PowerShell wildcard allow rules — prefix/wildcard rules (e.g. PowerShell(dotnet.exe build *)) weren’t pre-approving native executables
  4. Permission analysis gap — parser trusted stale variable-tracking values for PWD/OLDPWD/DIRSTACK across directory changes

Also: /usage now shows per-category breakdown (skills, subagents, plugins, per-MCP-server cost), /diff is keyboard-scrollable, GFM task list checkboxes render, and a find command that could exhaust the macOS vnode table is fixed.

Enterprise: allowAllClaudeAiMcps managed setting loads claude.ai cloud MCP connectors alongside managed-mcp.json.

Codex v0.134.0 alpha marathon starts

Three empty alphas in ~6 hours (alpha.1 at 19:03 UTC, alpha.2 at 21:32, alpha.3 at 01:05). Previous pattern: v0.133.0 stable shipped May 21, new marathon begins May 22. The pipeline never pauses.

Codex app — three features that expand agent surface area (May 21)

Not a CLI release but significant for the competitive landscape:

  • Appshots — press both Command keys to send the frontmost app window to Codex with a screenshot and extracted text. First coding agent to pull visual context from arbitrary apps without clipboard.
  • Goal mode GA — no longer experimental. Available in app, IDE, CLI. Persistent multi-turn objective pursuit.
  • Locked Computer Use — Codex continues working after Mac locks. Short-lived authorization, covered displays, relock on local input. First agent that explicitly works while you’re away from the machine.

Economic signal: OpenAI Q1 2026 margins

Zitron (via The Information reporting by Sri Muppidi, May 22):

MetricValue
Q1 2026 revenue$5.7B
Non-GAAP operating margin-122%
Estimated Q1 losses~$6.95B
Weekly active users (Q1 avg)905M
Paying customers55M (up from 47M EOY)
Conversion rate~6%
2026 revenue projection$30B
Potential 2026 losses>$36.6B (at current margin)

These are non-GAAP figures excluding stock-based compensation — actual losses could be higher. The -122% margin means OpenAI spent $2.22 for every $1 earned in Q1.

For context: Anthropic’s most recent disclosed figures (Q2 2026, per earlier Zitron analysis) showed $559M operating profit — but Zitron has questioned whether that reflects temporarily discounted SpaceX compute. Both sets of numbers have caveats. Neither vendor has published audited financials.

The structural observation: 905M weekly active users generating $5.7B in revenue implies ~$0.10/user/quarter at the consumer tier. The 55M paying customers generating the bulk of revenue implies ~$100/paying-user/quarter. The 849M non-paying users are the ChatGPT Go ad-supported audience, and the ad platform is the revenue engine for that segment.

Model signal: Nemotron-Labs-Diffusion

NVIDIA released Nemotron-Labs-Diffusion (May 20, blog May 23): a model family that unifies three decoding modes in one architecture.

VariantParamsModes
NLD-3B3BAR + diffusion + self-speculation
NLD-8B8BAR + diffusion + self-speculation
NLD-14B14BAR + diffusion + self-speculation

Each comes in base, instruct, and vision-language versions. The key metric:

  • Self-speculation mode: 6x tokens per forward pass vs standard autoregressive
  • 4x throughput vs Qwen3-8B at batch size 1
  • Acceptance length: 6.82 tokens per draft step (vs Eagle3’s 2.75)
  • Accuracy maintained: 64.04% on 10-task benchmark (AR: 63.61%)

The architecture eliminates auxiliary speculator models — the same weights serve all three modes by switching attention patterns. Trained on 256 H100s, initialized from Ministral3.

Local inference relevance: NLD-3B at Q4_K_M would be ~2GB — fits everywhere. NLD-8B at ~5GB fits all three machines. The 6x throughput claim, if it holds in practice with llama.cpp/MLX backends, would dramatically change the local generation speed ceiling. Currently limited to NVIDIA’s own inference stack (requires transformers ≥ 5.0.0). Community GGUF quants would need to support the diffusion decode path. Watch: bartowski/Unsloth GGUF availability, llama.cpp diffusion-mode support.

Cross-cutting: the cost-capability squeeze

The day’s data surface a structural tension across three dimensions:

The targetSubsidized growthIrrelevantEfficient but limitedNLD-8B localNLD-3B localCodex Appshots/CUAnthropic (claimed)OpenAI Q1 2026Low CapabilityHigh CapabilityUnsustainableSustainableAgent Capability vs Economic Sustainability

Each new agent capability (Appshots pulls screenshots from any app, locked Computer Use runs unattended, Goal mode persists for days) increases per-session compute cost. OpenAI’s -122% margin shows the aggregate effect. Nemotron-Labs-Diffusion offers one escape route: 6x throughput at constant accuracy means 6x less cost per token. But it’s GPU-only for now, and the local inference ecosystem hasn’t adopted diffusion-mode decoding yet.

The bet structure:

  • OpenAI is betting that 905M users at -122% margin will convert/monetize via ads + enterprise before the losses become untenable
  • Anthropic is betting that 80x growth and $303B compute commitments will reach profitability before IPO scrutiny reveals the cost structure
  • Local inference (NLD, TurboQuant) is betting that algorithmic efficiency can match or exceed cloud quality at zero marginal cost

Frame check

Dominant frame: “Quiet day, economic data is the signal.”

What would falsify it? If the security fixes in v2.1.149 indicate a deeper vulnerability surface than the quiet framing suggests. Four security fixes including a directory-traversal-equivalent PowerShell bypass and a worktree sandbox escape are not trivial — they’re the kind of issues that matter for enterprise trust. The “quiet” framing risks underselling the security work.

Adjustment: The security work IS part of the signal. Claude Code has now shipped security fixes in v2.1.147 (REPL prototype-pollution hardening), v2.1.149 (four fixes), and the overall arc since v2.1.145 (permission bypass fix). Three of the last five releases touched security. The hardening is continuous, not episodic.

Stub backlog

10 stubs enriched this run (131 → target 121). Backlog remains above 100; continuing 10-per-loop cadence.

Landscape changes

No thread resolutions. No new threads opened. Codex alpha marathon tracked under existing thread. OpenAI margin data folded into token economics thread. Nemotron-Labs-Diffusion noted in model landscape.

← all daily reports