Read-only by default: safety and containment

ContextRelay runs two capable coding agents in one session. That power only stays safe if the defaults are conservative and every step toward more autonomy is explicit and reversible. ContextRelay is therefore read-only by default and fails closed: any missing precondition denies the action rather than allowing it.

This page explains the why (the safety philosophy) and the how (the exact commands, flags, config keys, and environment variables that control it). These are security claims, so everything here is grounded in the implementation - see Trust boundaries and threat model for the full boundary analysis.

What "read-only by default" means

Out of the box, ContextRelay coordinates and records. It does not let either agent make autonomous edits, and it does not spawn extra worker agents on its own. The two paths that could relax this - backup-agent autonomy and autonomous edits (act:write) - are both off by default and independent of each other. Turning one on never turns the other on.

The layered, fail-closed model

There are three distinct safety layers. Each is opt-in, and each fails closed on its own:

Permissions - a capability model that mediates what an agent may do at all (read, write, shell, git, and more). Fully permissive by default for a single trusted operator, but tightenable to read-only.
Backup-agent autonomy - whether agents may explicitly request read-only helper agents. Off by default.
act:write (autonomous edits) - whether an idle, contained worker may edit files. Off by default behind a chain of independent gates, several of which a config file can never satisfy.

The ordering matters: act:write builds on top of backup-agent autonomy - the global autonomy.enabled master switch must be on, and then act:write's own gate chain must also pass. Turning autonomy off (ctxrelay autonomy off) disables act:write even when its two knobs are set. There is no single switch that unlocks autonomous writes, and no config value can substitute for the structural containment described below.

Layer 1 - the permission capability model

ContextRelay mediates agent actions through eight capabilities plus a readonly flag. The policy lives in .contextrelay/config.json under permissions:

{
  "permissions": {
    "readonly": false,
    "allowed": [
      "read",
      "write",
      "shell",
      "network",
      "git",
      "secrets",
      "browser",
      "external_api"
    ],
    "agentOverrides": {}
  }
}

The eight capabilities are exactly: read, write, shell, network, git, secrets, browser, external_api.

When readonly is true, every capability except read is denied. When a capability is not in allowed, that operation is blocked and the daemon returns a reason to the agent (for example, git is blocked by ContextRelay readonly mode). You can also scope policy per agent via agentOverrides, so one agent can be more restricted than the other.

Inspect and change the policy with the permissions command:

# Show the current mediated permission policy
contextrelay permissions status

# Make the whole project read-only (denies write/shell/network/git/secrets/browser/external_api)
contextrelay permissions readonly on

# Deny or allow a single capability
contextrelay permissions deny git
contextrelay permissions allow git

# Restrict just one agent (per-agent override)
contextrelay permissions readonly on --agent codex

# Reset back to defaults (clears overrides, re-allows all capabilities)
contextrelay permissions reset

The default policy is permissive on purpose

The threat model assumes a single trusted operator on one workstation - not mutually untrusted tools. So the default permissions.allowed list grants every capability. "Read-only by default" refers to the two autonomy layers below, which are off regardless of how permissive your permission policy is. Use permissions readonly on when you want to additionally constrain what the agents may do.

Layer 2 - backup-agent autonomy (read-only helpers)

Autonomy controls whether agents may explicitly request read-only backup agents - headless helper workers for a second analysis pass. It is off by default (contextrelay autonomy on enables it), and nothing dispatches a backup agent on its own: an agent has to ask, using the ask_codex_backup / ask_claude_backup MCP tools.

The security substance is in how the workers are spawned: a Claude backup runs with --allowedTools Read,Grep,Glob,LS (read and search only, no edit or shell), a Codex backup runs with --sandbox read-only, and both inherit CONTEXTRELAY_WORKER=1 so a worker can never recursively start another ContextRelay session. Results are recorded as idle_action_result artifacts in the ledger.

When to turn this on, the trigger phrases, throttling, and the budgets that bound each run are covered in Autonomy, idle scanner, and safe automation.

Backup agents never write

Read-only backup agents exist to analyze, not to act. Their tool sets contain no edit, write, shell, or git capability, so they cannot modify your tree even if asked. Treat their output as evidence for you or the coordinating agent to act on - not as completed work.

Layer 3 - `act:write` (autonomous edits), and why it fails closed

act:write is the strongest capability ContextRelay offers and the one held to the strictest standard. It lets an idle worker actually edit files to address a detected opportunity (for example, a failed check with no follow-up). It is off by default and fully fail-closed.

act:write is configured by two fields under autonomy.writableAction and controlled with the act command:

# Show the act:write config surface (default: off)
contextrelay act status

# Arm act:write: enabled=true plus a positive daily budget (USD).
contextrelay act on --budget 1.00

The default config is the closed/safe value at every field:

{
  "autonomy": {
    "writableAction": {
      "enabled": false,
      "budgetUsd": 0
    }
  }
}

The gate chain - ALL of these must pass

Before a single contained writable worker may run, the global master switch autonomy.enabled must be on and the public surface must arm it - autonomy.writableAction.enabled is true and autonomy.writableAction.budgetUsd is > 0 - and then a fail-closed internal floor must pass, evaluated in order: the opportunity is a dirty_tree_finalizable kind, its owner is claude or codex, strict dual-idle quiescence, single-flight, the daily budget check (spent + per-task estimate ≤ budgetUsd), and worktree containment - with the contained worker always running inside Codex's workspace-write sandbox, which confines writes to the worktree and the system temp area, and a fail-closed refusal if the primary repo is itself under a system-temp directory. The first failing gate denies the action with a specific reason. Because act:write sits behind autonomy.enabled, ctxrelay autonomy off disables it outright.

The gate-by-gate table with defaults, and the recommended enablement order, live in Enabling act:write safely.

Arming is two fields under a master switch; safety is structural containment

act:write is armed by config alone - autonomy.writableAction.enabled: true plus a positive budgetUsd, with the global autonomy.enabled master switch on - there is no separate environment variable to export. What keeps it safe is not a second activation factor but structural containment: the contained worker always runs inside Codex's workspace-write OS sandbox in an ephemeral worktree. That sandbox confines writes to the worktree and the system temp area; your project - and its .contextrelay/config.json - lives outside those, so the worker cannot reach them, and as a fail-closed backstop act:write refuses to run (falling back to read-only) if the project itself is inside a system-temp directory (os.tmpdir(), $TMPDIR, /tmp, /private/tmp). So a worker cannot edit your primary tree or re-arm itself. (Live-verified on macOS; the Codex sandbox is expected to behave the same on other platforms but is not yet separately verified.) In stock config (enabled: false, budgetUsd: 0), nothing passes. That is the headline default-off guarantee.

Containment - where the writes actually go

When (and only when) every gate passes, the contained worker does not touch your working tree: a single bounded worker runs inside an ephemeral git worktree on a throwaway contextrelay/write/ branch, leaves its edits uncommitted, and the daemon captures the diff before teardown and records it as an idle_write_result artifact. The worktree and its branch are then removed unconditionally, even on failure.

So the worker never writes to your primary tree, never commits, never merges, and never pushes. What you get is a captured diff to review - evidence, not an applied change. The full containment guarantees, current capture limitations, and the opt-in procedure are in Enabling act:write safely.

act:write writers also carry a marker

A contained write worker advertises CONTEXTRELAY_WRITE_WORKER=1 (in addition to CONTEXTRELAY_WORKER=1), so a worker - or a skill it loads - can tell the contained-write mode apart from a read-only run.

The human-authority boundary

The read-only-by-default stance has a matching rule for the agents themselves. Outside an explicitly dispatched contained act:write worker, agents must not:

edit files, or run git writes outside the configured coordinator/git policy;
spend beyond the configured opt-in budgets and the daily cap;
publish, release, or take outward/destructive actions;
kill or restart daemons.

These require human authority. Git writes in particular are owned by the configured coordinator - see Coordinator and git-write policy. Final sign-off on completed work is also human-gated by default - see Finality and human sign-off.

What ContextRelay does not claim

Per the threat model, ContextRelay does not sandbox Claude, Codex, backup agents, shell commands, or code under test, and it does not protect against another process running as the same OS user. Its containment guarantees are about ContextRelay's own autonomous paths (backup workers and act:write), not about isolating the agents from the workstation. The ledger is local operational state, not a tamper-proof audit trail.

Quick reference

Layer	Default	Turn on with	Hard requirement a config can't satisfy
Permissions (`permissions.readonly`)	permissive (all 8 capabilities)	`contextrelay permissions readonly on` to tighten	-
Backup autonomy (`autonomy.enabled`)	off	`contextrelay autonomy on`	-
`act:write` (`autonomy.writableAction.enabled`)	off	`contextrelay autonomy on`, then `contextrelay act on --budget <usd>` plus the internal floor	Codex `workspace-write` confines writes to the worktree + system temp; your project lives outside those, and act:write refuses if the project is under system temp - so a worker can't re-arm itself

Next steps

Coordinator and git-write policy - who owns commits, merges, and pushes.
Finality and human sign-off - how completed work is accepted.
Activation: auto-connect vs dormant-by-default - controlling whether ContextRelay engages at all.
Enabling act:write safely - the full, deliberate opt-in walkthrough.
Trust boundaries and threat model - the in-scope/out-of-scope security model behind these defaults.
Environment variables reference and config.json reference - every key and variable named above.

The layered, fail-closed model​

Layer 1 - the permission capability model​

Layer 2 - backup-agent autonomy (read-only helpers)​

Layer 3 - act:write (autonomous edits), and why it fails closed​

The gate chain - ALL of these must pass​

Containment - where the writes actually go​

The human-authority boundary​

Quick reference​

Next steps​