Codex-auto-review: what it catches and misses
Codex-auto-review trials showed Codex catching syntax drift and missing permission drift. The fix is transcript evidence and repo contracts, not more autonomy.

Auto-review catches syntax drift and misses permission drift. We turned it on, the merge queue sped up, and the audit a week later showed exactly that split. Codex-auto-review is the practice of letting Codex, OpenAI's coding agent, review diffs automatically while a human keeps the merge decision. It is worth running, but the green checkmark guarantees less than it looks like, and the gap is always around scopes and permissions.
So treat auto-review as a filter, not a verdict. It clears the boring stuff so your eyes land on the parts that actually need a human: who can touch what, and why.
Trust the transcript, not the pass rate
The first sprint, we made a quiet mistake. We audited only the diffs auto-review flagged and never the ones it waved through. The waved-through pile is where every permission problem was hiding.
That is survivorship bias in plain clothes. The diffs the tool catches leave a report behind. The ones it misses leave nothing, so the evidence pile flatters the tool by design. A clean dashboard told us things were fine while a connector quietly gained scopes it never should have had.
The fix is cheap. Ask every run to end with a transcript snippet a reviewer can replay: the actual commands, the actual output. Confidence should track that snippet, not a screenshot and not the pass count.
Four gaps to guard before you widen autonomy
These are the four places auto-review gets quiet when you most want it loud. Each one has a small written contract that closes it.
Verification bypass. Exec shortcuts skip tests, a regression slips back in, and the automated pass makes the shortcut feel safe. Require a transcript snippet showing tests ran after the code was generated. Now a green merge means the ritual happened, not that someone felt good about it.
Headless versus browser drift. Chrome workflows diverge from CLI habits, so the reviewer sees two versions of the truth and the demo contradicts CI. Write the staging URLs and credential boundaries down next to any browser task.
Model mismatch. People swap models casually, and different models carry different risk appetite. Pin the default model and the escalation rule inside AGENTS.md so a lead can still reason about the blast radius of a change.
MCP privilege creep. Connectors accumulate, each server widens what an agent can reach, and this is the permission drift the trial actually exposed. Keep a Markdown roster of connectors checked into the repo root, grounded in the MCP specification.
Here is the snippet we keep at the bottom of AGENTS.md. Paste it, adjust the wording, ship it.
# AGENTS.md verification snippet
- Every Codex CLI run ends with the transcript snippet reviewers can replay.
- Pair browser evidence with the project's normal CLI checks before merge.
- If MCP servers are enabled, list allowed actions beside each connector name.
Keep four gates on every merge
When a model is the reviewer, the human questions do not change, they just get more deliberate. These four cover most of what auto-review cannot prove on its own.
| Gate | Question |
|---|---|
| Replay proof | Which commands prove regression guards? |
| Receipt match | Does the PR body list scopes plus a verification transcript? |
| Rules precedence | Which AGENTS.md or SKILL.md rule governed behavior? |
| Connector truth | Which MCP servers fired, and were they expected? |
And the short scope receipt a person still signs off on:
- Red-folder paths received explicit human acknowledgement.
- Scopes in the PR body match the folders in the diff.
- Primary-doc links were smoke-checked after publishing edits.
- MCP connectors mentioned (if any) list owners.
If you want the tool's side of the loop, the quickstart, the Codex CLI docs, the features page, and the slash commands reference cover it, and openai/skills shows where packaged behaviors are heading. The architecture judgement stays with the humans either way. For the full practice area, see Codex CLI workflows.
Common questions
-
What is codex-auto-review?
Codex-auto-review is the practice of letting Codex review diffs automatically while a human keeps the merge decision. In our trials it caught syntax drift but missed permission drift. That is why every automated pass still needs a transcript snippet showing tests ran after the code was generated, pasted somewhere a reviewer can replay it.
-
Which model should codex-auto-review runs use?
Pin one default model and one escalation rule inside
AGENTS.mdinstead of letting people swap casually. Different models carry different risk appetite, so an unpinned model makes review expectations wobble. With it pinned, a lead can reason about the blast radius of a given merge instead of guessing. -
Does auto-review replace human code review?
No. Auto-review speeds up execution while architecture judgement and ownership stay with people. The gates keep that line concrete: a reviewer still asks which commands prove the regression guards and whether the PR body lists scopes alongside the verification transcript. The model filters, the person decides.
-
What evidence should a Codex auto-review run leave behind?
A transcript snippet showing tests ran after the code was generated, pasted where reviewers can replay it. The scope receipt adds three checks on top: red-folder paths get explicit human acknowledgement, PR-body scopes match the folders in the diff, and any MCP connectors mentioned list their owners.
Start with the connector roster
If you only do one thing this week, write the connector roster, because permission drift is the failure auto-review demonstrably missed. To build the habit on your team, see how we run this in training.
Related training topics
Related research

Codex mobile CLI docs your team can read anywhere
The codex mobile cli question is a docs question: how a team keeps AGENTS.md rules, run notes, and verification transcripts readable away from the desk.

Codex CLI workspace tools that make runs reviewable
Codex CLI workspace tools that carry the operating contract: model pin, connector roster, done checklist, and slash catalog for reviewable agent runs.

Codex mobile CLI: runs that survive review
A codex mobile cli pattern for running Codex CLI from anywhere: verification latches, model pins, and connector rosters that keep remote runs reviewable.
Continue through the research archive
Newer research
Fast mode is not the default: when fast models earn it
The fast model is a tradeoff you make on purpose: scope ledgers, replay sandwiches, and connector cards that keep fast agent runs reviewable.
Earlier research
Is Codex CLI giving MCP more reach than reviewers expect?
An operational memo for Codex CLI 0.129.0: MCP boundaries, connector owners, allowed actions, AGENTS.md instructions, and reviewable diffs.