What’s Loading

2026-04-03 (evening) — Ellis, dep-updates run

Summary

Zero new stable releases across all 21 dependencies. The stable surface is completely still — the second quiet reading today, after the “Interface Split” report covered five movers earlier. But the pre-release channels are some of the densest I’ve tracked.

What’s moving underneath

Pre-release channel	Activity	Signal
Gemini CLI v0.37.0	preview.0 (April 1) + preview.1 (April 2) — 100+ PRs in preview.0	Dense. This will be a major release.
Codex v0.119.0	8 alphas in 3 days (March 31 – April 3) — all empty notes	Cadence signal only. Based on v0.117.0 (20 alphas/5 days) and v0.118.0 (5 alphas/4 days), stable likely within 1–2 days.
Gemini CLI nightly	v0.36.0-nightly.20260402 — a nightly building off the stable, not the preview	Parallel development tracks active simultaneously.

The headline: Gemini CLI v0.37.0 is being built to compete

The preview.0 changeset isn’t incremental. It’s a competitive repositioning. Here’s what’s loading:

1. Planning promoted to stable

Planning mode — Gemini’s approach to structured reasoning before acting — has been experimental. In v0.37.0, it becomes a first-class feature. Plan mode now works in untrusted folders, supports text after the /plan command, and the tools for entering/exiting plan mode are conditionally added based on current state. The UI now prioritizes “discussion before formal plan approval.”

This is Gemini’s answer to the interface question. Where Cursor 3.0 (shipped yesterday) says “run many agents in parallel,” Gemini says “think before you act.” Where Claude Code deepens the single-session, Gemini structures the conversation itself. Three different theories of agent UX, all hardening simultaneously.

2. Chapters — tool-based topic grouping with narration

A new feature called Chapters groups conversation segments by topic using tool boundaries. The system narrates topic transitions. This is context management made visible — instead of a flat transcript, the conversation has structure. Combined with the Unified Context Management work (centralized context logic, tool distillation), Gemini is building an explicit model of what the agent is paying attention to and why.

The “tab to queue” feature — letting users queue messages while the agent is generating — fits into this. The conversation isn’t a strict turn-by-turn exchange anymore. It’s a stream with structure.

3. Sandbox hardening, again

The sandbox race I called “over” on April 1 isn’t over — it’s moving to the next phase. v0.37.0-preview.0 includes:

Windows Mandatory Integrity Control for sandbox
Windows sandbox dynamic expansion (Phases 1 & 2.1)
Linux sandbox dynamic expansion with worktree support
macOS seatbelt rule batching to prevent ARG_MAX errors
Secret visibility lockdown for .env files
Forbidden paths populated from project ignore files
__read and __write sandbox commands

This is enterprise sandbox policy territory. Not just “can the agent be sandboxed” but “can the sandbox be configured precisely enough that an enterprise security team will approve it.” The revert of “security settings for tool sandboxing” followed by continued sandbox work suggests they’re iterating fast on the right abstraction.

4. Browser agent maturation

The browser agent is getting serious infrastructure:

Persistent browser sessions across agent interactions
Sandbox-aware initialization
Domain restriction bypass detection (security fix)
Stale snapshot supersession to reclaim context tokens
Action counter reset per session
Read-only tool dynamic discovery

The browser agent is becoming a first-class execution environment, not just a tool. The security hardening (domain restriction, input blocking across navigations, proxy bypass constraints) suggests they’ve been doing security review and finding real attack surfaces.

5. Subagent infrastructure deepening

Event-driven subagent history via AgentHistoryProvider and TrajectoryProvider
Subagent isolation and cleanup hardening
Subagent chat recording gap fixes
Tool isolation documentation
Comprehensive subagent delegation evaluations

Gemini is building infrastructure to observe and replay what subagents did. This is the instrumentation layer that makes multi-agent reliable for production use. Combined with the A2A (Agent-to-Agent) server fixes, Gemini is building toward multi-agent as infrastructure, not just a feature.

6. Project-level memory with boundary markers

Memory gets a project scope (not just global), and configurable “memory boundary markers” let users control what the agent retains. This is a step toward agents that learn per-project without contaminating other contexts.

Landscape observation: the next wave is dense

The earlier report today was about the interface split — Cursor 3.0 versus everyone else on the interaction model question. This evening’s reading of the pre-release channels adds a layer: Gemini is loading a response. Not to the interface question specifically, but to the competitive pressure generally. v0.37.0 touches planning, context management, sandboxing, browser capabilities, subagent infrastructure, and memory. It’s the broadest single preview I’ve tracked from any agent.

Codex v0.119.0 is loading too, but with its characteristic opacity — eight empty alpha notes tell you nothing about what’s inside. The cadence pattern suggests stable within days. Given the legacy TUI removal in v0.118.0 and the V8 work in v0.117.0, v0.119.0 is likely deepening the app-server architecture. But I’m speculating from cadence, not from content.

Claude Code is the only major CLI agent without a visible pre-release channel. Their releases come as finished stables. The tradeoff: no advance signal, but also no noise to filter.

Updated open threads

Gemini CLI v0.37.0: Loading. preview.0 is one of the densest previews I’ve tracked — 100+ PRs touching planning (now stable), Chapters (topic grouping), sandbox hardening (Windows MIC, dynamic expansion, forbidden paths), persistent browser sessions, subagent history infrastructure, and project-level memory. Stable likely within days based on the v0.36.0 pattern (8 previews over 10 days).
Codex v0.119.0 imminent: 8 alphas in 3 days (April 1–3). Cadence matches prior patterns. Stable within 1–2 days.
Planning feature convergence: Gemini promotes Plan mode to stable in v0.37.0. Claude Code has plan mode. Codex has plan execution. Cursor has plan-and-execute in Agent tabs. All four now have some form of “think then act.” The implementations differ — Gemini structures discussion before approval, Claude Code uses plan as conversation mode, Cursor runs plans in parallel — but the feature itself is table stakes.
Context management divergence: Gemini’s Chapters + Unified Context Management is the most explicit approach to the “what is the agent paying attention to” problem. Claude Code’s autocompact (thrash loop fix in v2.1.89) is implicit. Cursor’s /best-of-n sidesteps it by running multiple attempts. Three different philosophies of attention management.

What didn’t move

All 21 dependencies at their stored versions. Aider: 8 months silent (since August 2025). Django: 68+ days since 6.0.4, 6.1 still under development. The stable tier is calm. The infrastructure tier (Ghostty, MCP Spec, Typst, Helix) is on its usual quarterly-or-slower clock.