Back to Research

Codex CLI 0.129.0: MCP and reviewable diffs

Codex CLI 0.129.0 workshop notes on MCP, AGENTS.md, verification loops, and reviewable diffs for engineering teams.

Editorial illustration for Codex CLI 0.129.0: MCP and reviewable diffs. I believed the shiny part was the point.
Rogier MullerMay 11, 20265 min read

The situation

Counter-thesis: the most useful Codex CLI release notes are about whether your team can keep a production loop reviewable.

I believed the shiny part was the point. I tried the new workflow, skimmed the changelog, and moved on. Then I watched teams adopt Codex CLI without updating repo instructions, verification habits, or the MCP boundary, and the result was predictable: faster edits, slower trust.

Diagnosis: this is context drift, the gap between a system’s speed and the team’s mental model. Donald Norman’s work on mental models is the useful lens here, and it maps cleanly to agentic coding.

The actual thesis: Codex CLI 0.129.0 matters most when you use it to tighten the loop around AGENTS.md, verification, and reviewable diffs.

For Codex users, that is the point of the changelog and the right frame for Codex CLI workflows.

Walkthrough

Resume and fork without losing the thread. If you have shipped AI code, you have hit this: a task pauses, then the next session starts with half the story missing. The 0.129.0 changelog calls out a redesigned resume/fork picker, raw scrollback mode, /ide context injection, and workspace-aware /diff. They all point at the same issue: the team cannot inspect what happened before it continues. The fix is the Resume Ledger: every handoff leaves behind a diff, a short note, and the exact workspace context that produced it.

# AGENTS.md
- Before resuming, read the last reviewable diff and the latest verification output.
- If the task forks, write the fork reason in one sentence.
- Never continue from hidden state alone; re-anchor in the workspace.
- Prefer workspace-aware /diff before asking for approval.

After that, resume stops being a guess. You get fewer “what changed?” conversations and more “does this diff pass?” conversations. That is tip one.

Modal editing is a workflow choice. If you live in the composer all day, Vim mode sounds cosmetic until command entry becomes the bottleneck. The release adds /vim, default-mode config, and Vim-specific keymap contexts, which makes the editor surface part of the workflow contract. The fix is the Composer Mode Contract: decide whether your team standardizes on modal editing, then document it in onboarding so the keymap is not a surprise.

That is the kind of thing worth covering in a Codex workshop or Codex CLI training: stable input models reduce friction, and the diff gets more attention than the editor. That is tip two.

Hooks and MCP need a boundary note, not optimism. The changelog adds hook browsing and toggling from /hooks, pre- and post-compaction hooks, and PreToolUse context, while also surfacing Codex Apps auth and eligible MCP elicitations through TUI/Guardian flows. That combination is useful and easy to overtrust. The fix is the MCP Boundary Note: write down which connectors are allowed, what they can see, and which actions require review before the model touches them.

# MCP boundary note
- Allowed: read-only issue lookup, repo metadata, approved docs search.
- Not allowed: secrets, production writes, or unreviewed admin actions.
- Any new connector must pass least-privilege review before enablement.
- Hooks may enrich context, but they do not widen permissions.

Once teams do this, MCP stops being a vague integration story and becomes a controlled extension point. That is tip three.

Plugin sharing is governance, not convenience. 0.129.0 expands workspace sharing, share access controls, source filtering, local share path tracking, marketplace removal and upgrades, remote bundle sync, and admin-disabled status handling. If you have shipped AI code, you have seen the failure mode: someone installs a plugin because it works, and nobody can explain who owns it or what changed. The fix is the Share-and-Source Rule: every shared plugin names its source, owner, and rollback path.

That changes the review conversation immediately. Instead of “can we use this?” the team asks “who can update this, and how do we remove it?” That is tip four.

Goals and validation need a pause state. The release says experimental goals are discoverable, stay paused across resume unless the user opts back in, and show clearer validation and multi-day duration output. That is the right shape for production work: the system should not silently re-enable intent after interruption. The fix is the Pause-Until-Intent Rule: if a goal or long-running task resumes, require an explicit re-acknowledgment before it continues.

In a Codex workshop, I would pair that with a short review checklist:

  • Did the task resume from visible state?
  • Did verification run?
  • Is the diff still the same?
  • Does AGENTS.md still match the workflow?

That is tip five.

Synthesis: if the diff is not reviewable, the workflow is not finished. I repeat that because it is the thesis in one sentence, and it is the standard I want teams to carry into every Codex CLI loop.

Tradeoffs and limits

Codex CLI 0.129.0 does not remove the need for human judgment. Modal editing can speed one person up and confuse another; MCP can widen capability and widen risk; plugin sharing can improve reuse and spread bad defaults.

The limit is the same across all of it: the tool can accelerate a loop, but it cannot prove the loop is safe. That still belongs to the team, and that is why the thesis phrase matters: keep the loop reviewable.

A practical methodology note: in the Review step, treat every Codex-generated change as incomplete until the diff, the verification output, and the instruction file all agree.

Further reading

Where to go next

If you are turning this into a team standard, start with the repo’s AGENTS.md and then map it to your Codex CLI workflow. For workshop framing, use /topics/cli-workflows as the shared entry point.

Related training topics

Related research

Continue through the research archive

Ready to start?

Transform how your team builds software.

Get in touch