Train Your Team on Codex Remote
A practical Codex training guide for teams adopting Codex Remote, AGENTS.md, MCP, and reviewable CLI workflows.

Codex Remote reaching general availability is a good moment to train your team on remote-first Codex CLI workflows, not just ad hoc prompts. Start with one repo, one AGENTS.md, and one verification loop that produces reviewable diffs.
OpenAI Codex, OpenAI's coding agent, is most useful when the team agrees on how it should read context, change code, verify work, and hand back a pull request. A Codex training plan is a short, repeatable way to teach those habits in a real production codebase.
As of June 25, 2026, the official Codex changelog says Codex Remote has reached general availability. Treat that as a workflow milestone: the governance now belongs in the repo, not in everyone’s private prompt history.
Pick one production workflow to teach first
Run the first session around a real workflow your team already reviews every week. Good candidates are small bug fixes, dependency updates, flaky test cleanup, API client changes, or UI copy changes with snapshot tests.
The learning outcomes should be narrow and visible. By the end, each engineer should be able to start a Codex CLI task, point it at repo instructions, inspect the diff, run the agreed verification command, and decide whether the result is ready for review.
For sizing, keep the first Codex CLI training session to 4–8 engineers and 90 minutes. Larger groups tend to drift into demos; smaller groups can actually compare prompts, diffs, and failure modes.
The trap is teaching on a toy repo. Toy repos make the agent look smooth, but they skip the parts that matter: weird test data, local conventions, permissions, generated files, and reviewers who care about the diff.
For more workflow examples in this cluster, keep the related training topic close by while you build your internal workshop.
Put repo rules in AGENTS.md before prompt craft
Write the first version of AGENTS.md before you teach clever prompting. Codex agents need durable repo rules more than they need one-off instructions from the loudest person in the room.
A useful AGENTS.md says what code style to follow, which commands prove the work, which files are generated, and which areas need extra care. Keep it short enough that an engineer will actually maintain it.
Example for a service repo:
# AGENTS.md
- Prefer the smallest change that fixes the issue.
- Do not edit generated files under src/generated. Run pnpm generate instead.
- For API changes, update tests in tests/api and run pnpm test:api.
- For database changes, include the migration and the rollback note.
- Before handing back work, run pnpm lint and the narrowest relevant test.
The reason this matters is simple: Codex can only follow team conventions if the team has written them down somewhere stable. Nested AGENTS.md files are especially useful when a monorepo has different rules for frontend, backend, infra, and docs.
The trap is turning AGENTS.md into a policy encyclopedia. Long files get stale, and stale rules are worse than missing rules because they teach people not to trust the instructions.
If you are rolling this out across several repositories, pair this article with AGENTS.md for Codex Teams and keep one owner per repo instruction file.
Practice the Codex CLI verification loop
Make verification a muscle, not a cleanup step. A good Codex CLI workflow has three beats: describe the task, let Codex make a focused change, then require evidence before the diff goes to review.
A simple loop looks like this:
codex
# Ask for one bounded change.
# Review the plan before edits.
pnpm lint
pnpm test -- --runInBand tests/api/user-settings.test.ts
git diff --stat
git diff
The measurable after-state is not that everyone feels faster. It is that a team lead can open a Codex-assisted pull request and see a small diff, a clear test note, and no surprise edits outside the task boundary.
The trap is asking Codex to do broad work and then reviewing vibes. Broad tasks create broad diffs, and broad diffs make human review worse.
For remote workflows, add one more habit: make the verification command part of the handoff. If Codex could not run a command, the PR note should say that plainly so the next human knows what still needs checking.
Set MCP and skills boundaries early
MCP, the Model Context Protocol, is a standard way for coding agents to connect to external systems such as issue trackers, repositories, databases, and internal knowledge bases. In Codex training, MCP should be taught as an access boundary, not as a magic context hose.
Start with one useful integration. For example, let Codex read GitHub issues or a design doc, but do not grant write access to production systems during the first rollout.
A practical boundary note can live in AGENTS.md or your team rollout doc:
MCP access boundary
- Codex may read linked GitHub issues and repo documentation.
- Codex must not write to Jira, Slack, production databases, or deployment systems.
- If external context changes the implementation plan, summarize the source and the decision in the PR note.
Skills fit a different need. Use a skill when the team has a repeatable procedure, such as adding a new API endpoint, running a release checklist, or updating a design token package.
The trap is connecting everything on day one. More access creates more audit surface, and it can make the agent look confident about context that reviewers have not seen.
Copy this Codex team rollout plan
Paste this into your internal docs and edit it for one repo. Keep the first rollout boring on purpose.
Codex team rollout plan
Team and scope
- Repo:
- Team owner:
- Participants: 4-8 engineers
- Duration: 90 minutes
- First workflow: one small production change with tests
Learning outcomes
- Start a bounded Codex CLI task from the repo root.
- Use AGENTS.md as the durable instruction layer.
- Keep the diff small enough for normal review.
- Run the agreed verification command or document why it could not run.
- Write a PR note with changes, tests, and remaining risk.
Session flow
- 10 min: Pick one real issue and read the relevant AGENTS.md.
- 15 min: Ask Codex for a plan before edits.
- 25 min: Let Codex make the change, then inspect the diff.
- 20 min: Run lint, tests, or the narrowest useful check.
- 10 min: Write the PR note.
- 10 min: Compare what should be added to AGENTS.md.
Repo instruction starter
- Do not edit generated files directly.
- Prefer narrow tests over full-suite runs unless the change is cross-cutting.
- Ask before changing public APIs, migrations, auth, billing, or deployment files.
- Include verification output in the handoff.
Review checklist
- Is the task boundary clear?
- Does the diff avoid unrelated cleanup?
- Did Codex follow AGENTS.md?
- Are tests or checks named explicitly?
- Is any skipped verification called out?
- Would a normal reviewer approve this without extra archaeology?
Measurable after-state
- At least two engineers can repeat the workflow without facilitation.
- The repo has an AGENTS.md owner.
- The team has one accepted PR that includes a Codex-generated or Codex-assisted diff plus a human verification note.
The tradeoff is that this plan will feel small. That is the point: teams learn faster from one safe, reviewable path than from a giant menu of agent capabilities.
Common questions
-
How should we run codex training for an engineering team?
Run codex training as a hands-on repo session, not a slide deck. Keep it to 4–8 engineers for about 90 minutes, use one production workflow, and require every participant to inspect a diff, run or name verification, and update the team’s AGENTS.md notes afterward.
-
Do we need AGENTS.md before using Codex Remote?
You can use Codex Remote without a perfect AGENTS.md, but you should not scale team usage without one. Start with 5–10 rules covering generated files, test commands, sensitive areas, and PR handoff expectations, then let reviewers add missing conventions as they find them.
-
Where does MCP fit in Codex CLI workflows?
MCP belongs where Codex needs external context to do the task correctly. Start with read-only access to one system, such as GitHub issues or internal docs, and document the boundary clearly; write access should wait until your team has audit habits and rollback expectations.
-
What should a good Codex-assisted diff include?
A good Codex-assisted diff should be small, relevant, and easy for a human to review. The PR note should name the requested change, the files touched, the verification command, and any checks Codex or the engineer could not run.
-
Does Codex Remote GA change our governance model?
Yes, it should move governance from individual habits into shared repo artifacts. As of June 25, 2026, Codex Remote is generally available, so teams should treat AGENTS.md, MCP boundaries, verification loops, and review checklists as part of the operating model.
Further reading
- OpenAI Developers — Codex quickstart
- OpenAI Developers — Codex CLI
- OpenAI Developers — Codex CLI features
- OpenAI Developers — Codex slash commands
- Model Context Protocol — specification
- GitHub — openai/codex
- GitHub — openai/skills
- OpenAI Developers — Codex changelog
Start with one boring win
Pick one repo, write the first AGENTS.md, and run the 90-minute workflow with a real issue this week. If the resulting diff is reviewable and the verification note is clear, you have the base pattern for the next team.
One methodology lens
One useful way to read this through our methodology is the Plan step: delegate first-pass decomposition and dependency mapping, review the sequencing and assumptions, and keep ownership of scope and priorities. If that split is still fuzzy, the workflow usually is too.
Related training topics
Related research

Codex CLI 0.123.0: workflows that hold up
Codex CLI 0.123.0 workflows that hold up in review: replay recipes in the diff, a pinned model, a connector roster, and a ten-line done checklist.

Codex CLI 0.124.0: tighter rollback loops
Codex CLI 0.124.0 as a workflow moment: shrink the rollback contract, pin the model, and keep a connector roster and done checklist where reviewers live.

Codex 5.5: pin the model before you swap it
Codex 5.5 questions are model governance questions: pin the default model and escalation rule in AGENTS.md, and keep browser checks bridged to CLI receipts.