Codex CLI 0.125.0 for reviewable agent loops
Codex CLI 0.125.0 improves app-server plumbing, permission state, and JSON output for reviewable workflows.

Codex CLI 0.125.0
The situation
Codex CLI 0.125.0 is a workflow release. The changelog points to changes that matter when teams run Codex against real codebases: app-server plumbing, permission state that survives across sessions, better JSON output for automation, and fewer failure modes in review and execution loops.
For teams using Codex in a production coding loop, the pattern stays the same: read instructions from the repo, make a bounded change, verify it, and hand back a reviewable diff. If your team already uses Codex CLI workflows, this release mostly smooths the edges.
The main governance change is permission persistence. Permission profiles now round-trip across TUI sessions, user turns, MCP sandbox state, shell escalation, and app-server APIs. That keeps the same trust decisions across more of the workflow instead of forcing teams to re-assert them in each surface.
Walkthrough
Start with the repo instructions. Codex’s docs use AGENTS.md for custom instructions, and the skills repo follows the same write-once pattern for repeatable task bundles. The first step is still to make the repo legible to the agent before asking it to change code.
A minimal AGENTS.md fragment can be enough to anchor behavior:
# AGENTS.md
- Read this file before editing code.
- Keep changes small and reviewable.
- Run the project’s test command before finishing.
- If a task touches permissions, explain the scope in the final note.
Use the release’s JSON and tracing improvements to keep the loop inspectable. The changelog says codex exec --json now reports reasoning-token usage, and rollout tracing records tool, code-mode, session, and multi-agent relationships. That helps when you need to compare runs, spot expensive prompts, or explain why one agent branch behaved differently from another.
A practical pattern is to keep machine-readable output in your automation layer and the human review in the diff:
codex exec --json -- <your-task>
Then inspect the output for token usage, tool activity, and any session metadata your wrapper stores. If you are orchestrating multiple agents, keep one branch focused on implementation and another on verification so the resulting diff stays readable.
The app-server changes matter when Codex is connected to external tools or remote state. The release adds Unix socket transport, pagination-friendly resume/fork, sticky environments, and remote thread config/store plumbing. In plain terms, it is easier to keep a long-running workflow attached to the same environment and to resume or fork work without losing context.
That is also where MCP fits. MCP is the standard way to connect AI applications to external systems, and Codex’s docs place MCP alongside tools, sandboxing, and orchestration. If your workflow already uses remote services or local connectors, treat the new app-server plumbing as a reliability upgrade. The goal is still the same: keep the agent’s scope explicit and the external surface controlled.
For teams that package repeatable tasks, the skills repo is the clearest adjacent pattern. Skills are folders of instructions, scripts, and resources that agents can discover and use repeatedly. If your team has a common review checklist, migration routine, or test harness, package it as a skill rather than copying prompts into every run. The release’s improved permission persistence and app-server behavior make that kind of reusable workflow easier to keep stable.
A simple operating loop looks like this:
- Put repo-specific rules in AGENTS.md.
- Package repeated tasks as skills.
- Run Codex in a bounded branch or worktree.
- Verify with tests or a targeted check.
- Review the diff, not just the agent transcript.
The changelog’s bug fixes reinforce that habit. Interrupting /review and exiting the TUI should no longer wedge delegate startup or unsubscribe. Exec-server no longer drops buffered output after process exit and waits correctly for stream closure. WebSocket clients are less likely to disconnect during bursts of turn and tool-output notifications. These are the kinds of fixes that matter when you are trying to make the loop dependable enough for daily use.
A small methodology note: this release is mostly a Review story. The useful habit is not to trust the agent more, but to make the output easier to inspect, resume, and compare. That is the same reason we keep a methodology step for review gates before merge.
Tradeoffs and limits
The release improves orchestration, but it does not remove the need for boundaries. Permission profiles that persist across surfaces are helpful only if your team has a clear policy for when escalation is allowed and when it is not. If those rules are vague, persistence can preserve the wrong decision just as reliably as the right one.
Multi-agent support also adds coordination overhead. The changelog mentions tracing for multi-agent relationships and config/schema handling around thread limits. That is a sign that teams should be careful about fan-out: more agents can mean more parallelism, but also more state to inspect and more ways to create noisy diffs.
There are also product edges to watch. The release notes mention fixes for conflicting MultiAgentV2 thread limits, relative agent-role config paths, unsupported MCP bearer-token fields, and invalid js_repl image MIME types. Those are reminders that config hygiene still matters. If your setup is loose, the agent will surface the mismatch later, usually in the middle of a task.
Finally, app-server and remote-plugin features are only useful if you can explain them to the team. If a workflow depends on sticky environments or remote thread stores, document the expected lifecycle and failure recovery path. Otherwise, the workflow becomes harder to reproduce than the code change it was meant to simplify.
Further reading
Related training topics
Related research

Codex GPT-5.5 and browser checks
GPT-5.5, in-app browser use, and auto-reviewed approvals in Codex. What changed and how to tighten the loop.

Codex CLI 0.124.0: tighter loops
Codex CLI 0.124.0 tightens AGENTS.md, hooks, MCP, Bedrock, and reviewable CLI workflows.

Codex CLI 0.128.0: workflows that hold up
Codex CLI 0.128.0 tightens AGENTS.md, resume loops, permissions, plugins, and reviewable diffs.