Development

/pr-loop

Full PR loop: branch, test, commit, push, PR, review, fix, merge. Triggers: create PR, finish work.

$ golems-cli skills install pr-loop

Golden

100% best pass rate

36 assertions

9 evals

fixtures

Updated 1 week ago

The full loop. Not "create PR." Not "push and move on." The FULL loop through MERGED.

The Iron Law

MISSION = MERGED
Not "tests pass." Not "PR created." Not "pushed."
Done = PR merged + branch deleted + main pulled.

Parking Is the Violation (2026-06-12)

Etan, verbatim, after a finished branch was left local "awaiting PR approval": "For fuck's sake, why do you need me to tell you to do a PR loop? Fuck."

If the work itself was already user-approved (mission sign-off, approved queue, collab dispatch), that approval IS the permission for the loop. Finishing a branch and parking it "awaiting approval" is the violation — not the push.

WRONG: work approved → branch finished → "awaiting Etan PR approval" → idle
RIGHT: work approved → branch finished → push → PR → review loop → MERGED

"Never commit/push without explicit permission" governs UNREQUESTED work. It is satisfied by the approval that authorized the task. Do not re-ask at the finish line; surface the PR number, not a permission question.

Mechanical enforcement (gen-18 Track 1): parking is now caught by the idle-dwell gate — a finished, approved branch left "awaiting PR approval" is IDLE_SEAT_OPEN_QUEUE. Run /idle-dwell-gate on the terminal turn before ending (bun skills/golem-powers/idle-dwell-gate/scripts/idle-dwell-gate-cli.mjs <transcript|->, exit 3 = FLAG). FLAG ⇒ finish the loop to MERGED; surface the PR number, not a permission question.

Autonomous Agent Mode

If you are running autonomously (no human in the loop), these rules are mandatory:

Never merge with 0 reviews. Wait or invoke bots. No exceptions.
Review wait timer: After invoking reviewers, wait minimum 120s before first check. If no reviews after 5 min, re-invoke. After 15 min with no response, self-merge only if CI is green.
Post to collab with PR number immediately after creation AND after merge (with test counts).
CRITICAL/HIGH comments require reply before merge — fix, or explicit "won't fix because X." Zero replies = cannot merge.
Max 3 review rounds. If round 3 still has new issues, merge and create follow-up ticket. Infinite review loops are worse than shipping with known minor issues.

Full SKILL.md source — includes LLM directives, anti-patterns, and technical instructions stripped from the Overview tab.

The full loop. Not "create PR." Not "push and move on." The FULL loop through MERGED.

The Iron Law

MISSION = MERGED
Not "tests pass." Not "PR created." Not "pushed."
Done = PR merged + branch deleted + main pulled.

Parking Is the Violation (2026-06-12)

Etan, verbatim, after a finished branch was left local "awaiting PR approval": "For fuck's sake, why do you need me to tell you to do a PR loop? Fuck."

WRONG: work approved → branch finished → "awaiting Etan PR approval" → idle
RIGHT: work approved → branch finished → push → PR → review loop → MERGED

Autonomous Agent Mode

If you are running autonomously (no human in the loop), these rules are mandatory:

Never merge with 0 reviews. Wait or invoke bots. No exceptions.
Review wait timer: After invoking reviewers, wait minimum 120s before first check. If no reviews after 5 min, re-invoke. After 15 min with no response, self-merge only if CI is green.
Post to collab with PR number immediately after creation AND after merge (with test counts).
CRITICAL/HIGH comments require reply before merge — fix, or explicit "won't fix because X." Zero replies = cannot merge.
Max 3 review rounds. If round 3 still has new issues, merge and create follow-up ticket. Infinite review loops are worse than shipping with known minor issues.

Hierarchical Worker Mode (gen-12 weave E09)

When a dispatch brief says LEAD owns merge (or "worker endpoint = PR + review responses"):

Worker's endpoint = PR opened + review responses addressed — do NOT re-derive MISSION = MERGED and merge locally unless the brief explicitly grants merge authority to this worker.
Worker stops at: branch → implement → verify → commit → push → PR → invoke reviewers → fix review threads → post TASK_DONE with PR URL.
LEAD merges after a clean loop on the worker PR (or only when the brief explicitly says this worker merges).

Evidence: two independent re-derivations of the conflict (vlW7#5 "brief explicitly says no merge"; kg-harvest#2).

Review-Without-Merge ≠ Draft (gen-12 weave E09)

Draft PRs silently skip bot reviews (CodeRabbit skipped a PR because it was draft — phoenix-revival#3).

WRONG: gh pr create --draft … then wait for @coderabbitai
RIGHT: Create ready-for-review PRs; if draft was used, `gh pr ready <N>` BEFORE
       invoking reviewers. Never merge while still draft.

"Review without merge" means mark ready-for-review and run the review loop — NOT leave the PR in draft state.

Lead Merge-Timing (gen-12 weave E09)

LEAD (or any merger) must never merge a worker's PR mid-review-fix:

# BEFORE gh pr merge — confirm head matches worker's latest push
gh pr view <N> --json headRefOid,commits,reviewDecision

Checklist:

headRefOid matches the worker's latest pushed commit (not an earlier SHA mid-fix)
No open CRITICAL/HIGH review threads awaiting a fix push
Re-review requested after the final fix commit

Evidence: #257 merged at 61d6f50 before fix 750efdd landed (recovered via #260); #256 same evening — stranded fixes.

Deploy Truth Gate (gen-12 weave E09)

Merged ≠ serving. Deploy/live claims require a same-moment live probe of the running process — not merge SHA alone.

Claim	Required probe (same turn)
"deployed" / "live" / "restarted"	Running PID + binary/commit of the served process (`ps`, `launchctl list`, health endpoint)
"page server updated"	Hit the served URL/asset AFTER rebuild/restart — not the build directory

Expands Post-Merge Verification below. Evidence: "deployed #250" collab claim was wrong because the atomic app rebuild never restarted the page server (77631d2e#3); orchestrator 62517efa:1105.

Mechanical enforcement (gen-18 Track 2 — the false-green kill-gate): this gate is no longer prose-only. Before emitting TASK_DONE or any "done / deployed / green / render-complete" message, run the false-green gate on the turn — /false-green-gate (bun skills/golem-powers/false-green-gate/scripts/false-green-gate-cli.mjs <transcript|->, exit 3 = FLAG). It requires the SAME-TURN live probe the claim domain demands: render → ffprobe size>0+duration>0 (+ resolved clone voice); dashboard → HTTP 200 and signed-in click-through; deploy → build-stamp post-dates merge and a live round-trip; build → operational entrypoint exercised. A FLAG means the claim is unearned — run the probe, then claim. Composes with /deploy-verify. For a narration/AfterCode "render done / give it a play", also run the stricter /render-done-gate (bun skills/golem-powers/render-done-gate/scripts/render-done-gate-cli.mjs <transcript|->, exit 3 = FLAG): ls+ffprobe(size>0,duration>0) of the CLAIMED mp3 + reachable surface + a registered cloned voice (fail-closed).

PR-Referenced Artifacts Must Be Committed (gen-12 weave E09)

Artifacts cited in PR bodies, review threads, or merge verification cannot live only in gitignored docs.local/ — reviewers and CI cannot fetch them.

WRONG: PR body links orchestrator/docs.local/plans/foo.md (gitignored)
RIGHT: Commit the artifact to a tracked path, paste the excerpt inline, or link
       a committed copy; docs.local is local cache only when a committed record exists

Evidence: phoenix-revival#7.

Single-Account Review Verdicts (gen-12 weave E09)

On EtanHey repos (single GitHub account), agent PR verdicts go as PR comments — inline review comments or gh pr comment — NOT formal self-review REQUEST_CHANGES (GitHub blocks self-approve/self-request-changes).

WRONG: gh pr review --request-changes on your own PR
RIGHT: Post structured verdict as a PR comment; merge authority follows Merge
       Authority below after a clean bot-reviewed loop

Evidence: codexE08#5. Complements the existing Merge Authority single-account note.

The Full Loop

1. BRANCH    git checkout main && git pull && git checkout -b feat/name
2. IMPLEMENT Write code (invoke /superpowers:test-driven-development)
3. TEST      Run full test suite — ALL must pass
4. VERIFY    Invoke /superpowers:verification-before-completion
             ↳ DAEMON GATE: If this PR touches daemon/socket/MCP code,
               you MUST test with a real client session before proceeding.
               See "Daemon Verification Gate" below.
5. COMMIT    git add <specific files> → CodeRabbit pre-commit review → commit
             ↳ Codex env: run `coderabbit review --agent` with a ~3 minute
               hard timeout BEFORE committing. If the local CLI hangs or hits
               rate/review limits, stop it, record the limitation, and commit
               on fresh test evidence. After the PR exists, require PR-level
               bot status/comments before merge.
               If CRITICAL issues found → fix first. If minor → proceed.
               Ralph mode: `--story=ID --message=MSG` for atomic commit + criterion.
6. PUSH      git push -u origin feat/name
7. PR        Create PR (see "Creating the PR" below)
8. REVIEW    Fetch + read review comments (see "Reading Reviews" below)
9. FIX       Address real bugs from review
10. MERGE    gh pr merge <N> --squash --delete-branch
11. CLEANUP  git checkout main && git pull

Git Sanitization Gate (MANDATORY)

git status is clean (no uncommitted changes, no staged or unstaged work-in-progress)
If untracked files exist, each one is explicitly accounted for with:
- keeping because X, or
- should be gitignored, or
- committing now
No stale branches are left behind
Working directory is on the expected branch for this task

Codex Worktree Patch Gate

If the assigned worktree differs from the session cwd, every apply_patch filename must be an absolute path inside the worktree. exec_command.workdir does not apply to apply_patch; relative patch paths resolve against the session cwd and can mutate the main checkout silently. If a patch lands unexpectedly, check git status in both the main checkout and the assigned worktree before continuing.

Step 7: Creating the PR

Prerequisites

gh CLI installed (brew install gh)
Authenticated: gh auth login
On a feature/fix branch (not main/master/dev)
All changes committed

Create the PR

# Push branch first
git push -u origin HEAD
 
# Create PR with structured body
gh pr create --title "feat: description" --body-file - <<'EOF'
## Summary
- What changed and why
 
## Test plan
- [ ] Tests pass
- [ ] Manual verification done
 
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
EOF

If the PR body or comment contains backticks, use --body-file (or stdin). Shell command substitution will mangle inline code and can silently rewrite the evidence you meant to preserve.

Edge Cases

On main/dev/master: Don't create PR. Switch to a branch first.
Uncommitted changes: Commit first.
PR already exists: Use gh pr view to check, don't create duplicate.
Custom base branch: gh pr create --base dev

Step 8: REVIEW (The Critical Step)

This is NOT optional. This is NOT "auto-merge."

NEVER Merge With 0 Reviews

# Check BEFORE merging — empty reviewDecision + <2 comments = nobody looked
gh pr view <N> --json reviewDecision,comments

Review Wait Timer (ENFORCED):

After invoking reviewers → wait minimum 120 seconds before first check
If no reviews after 120s → check again at 5 minutes
If still no reviews at 5 min → re-invoke reviewers explicitly
After 15 minutes with no response → self-merge ONLY if CI green (autonomous mode)

CLEAN status with no reviews ≠ approved. It means NOBODY LOOKED.

Bot	Expected time	Notes
CodeRabbit	2-5 min	Auto-reviews on push
Greptile	Needs OSS approval	Manual activation
Macroscope	Needs activation	Auto-reviews once installed

If reviewDecision is empty and comments < 2 → DO NOT MERGE.

Step 8a: Invoke Reviewers

Always explicitly request reviews. Don't wait for auto-detection.

# Invoke all available reviewers (do this right after PR creation)
gh pr comment <N> --body "@coderabbitai review"
gh pr comment <N> --body "@greptileai review"
gh pr comment <N> --body "@codex review"
gh pr comment <N> --body "@cursor @bugbot review"

Reviewer roster reality can degrade. If Greptile is unavailable, Cursor/Bugbot is billing-blocked, or CodeRabbit is rate-limited, do not burn dead mentions. Use Codex + Macroscope + cr review --plain before commit, and document which reviewers were unavailable in the PR.

For private repos (no bot reviewers):

# Option A: Use coderabbit:code-reviewer subagent
Agent(subagent_type="coderabbit:code-reviewer", prompt="Review PR #N")
 
# Option B: Use cr CLI
coderabbit review --agent  # Codex env, bounded to ~3 minutes

For public repos (bot reviewers configured):

# Poll for reviews (preferred: /loop 2m, or CronCreate */2, or manual sleep 90)
gh pr view <N> --comments

Reading Review Comments

Fetch comments from all review sources with full context:

# Quick view of all comments
gh pr view <N> --comments
 
# Detailed: get review comments with diff context
gh api repos/{owner}/{repo}/pulls/{N}/comments

To reply to an inline review thread, use the replies endpoint:

gh api --method POST \
  repos/{owner}/{repo}/pulls/{N}/comments/{comment_id}/replies \
  -f body='Fixed in commit abc123.'

comment_id is the numeric top-level review comment ID, not a node ID. Replies are one level deep; you cannot reply to an existing reply. The bare comments collection path does not reply to a thread; use /replies or the documented in_reply_to form.

Review sources (coverage stack):

Source	Type	How to Trigger / Check
CodeRabbit	AI review + auto-summaries	Auto on PR. Also: CodeRabbit plugin or `coderabbit review --agent` in Codex env (`cr review --plain` for human terminal use)
Codex Cloud	AI code review	`gh pr comment <N> --body "@codex review"` or comment manually on GitHub. Auto-reviews if enabled in Codex settings. Reads AGENTS.md "Review guidelines". Flags P0/P1 by default.
Cursor Bugbot	Bug detection	`gh pr comment <N> --body "@cursor @bugbot review"` or comment manually on GitHub. For re-review after fixes: `gh pr comment <N> --body "@cursor @bugbot re-review"`. Bot responds as `cursor[bot]`.
Greptile	AI review + codebase understanding	Comment `@greptileai review`. Needs OSS activation.
DeepSource	Static analysis	Check via CI status

After fixing review feedback, trigger re-review on every reviewer:

gh pr comment <N> --body "@coderabbitai review"
gh pr comment <N> --body "@codex review"
gh pr comment <N> --body "@cursor @bugbot re-review"

Codex Cloud is enabled on: EtanHey/voicelayer, EtanHey/orchestrator, EtanHey/golems, EtanHey/brainlayer.

Investigate Before Dismissing (CRITICAL)

Default stance = "let me investigate" — NOT "this is intentional."

The worst PR loop failure mode: auto-dismissing a reviewer suggestion with "intentional per design doc" without checking if the reviewer found a real gap the design doc missed.

WRONG:
CodeRabbit: "Missing orphan reparenting when a node is deleted"
You: "@coderabbitai This is intentional per phase5-v2-synthesis.md. Please learn this."
Later: Realize CodeRabbit was right. Design was wrong. You taught it a bad Learning.

RIGHT:
CodeRabbit: "Missing orphan reparenting when a node is deleted"
You: Read the design doc. Does it explicitly address THIS tradeoff?
  → Yes, with clear reasoning → Push back with the specific passage.
  → No, or vague → Treat as potential gap. Investigate before closing.

The investigation protocol for any "conflicts with design" suggestion:

Read the actual design doc section referenced — don't rely on memory
Ask: does this doc EXPLICITLY address the tradeoff the reviewer raised?
If yes, with clear reasoning → reply with the specific passage
If no, or only implicitly → investigate the reviewer's concern as a real gap
If the design doc is WRONG → update the design doc AND correct any bad Learnings already taught

Teaching a reviewer a bad Learning is worse than not teaching it anything. A false Learning suppresses future flags on a real bug category. Audit any Learning you've set if the underlying assumption turned out to be wrong:

# CodeRabbit: flag a Learning for correction
@coderabbitai I need to correct a previous learning. [Pattern X] does actually require
[handling Y] — our earlier design was incomplete. Please update your understanding:
[correct explanation].

No Silent Ignoring (CRITICAL — from dismissed review mining)

Every CRITICAL or HIGH review comment MUST receive an explicit reply. Not fixing is acceptable. Not replying is NOT.

WRONG: PR #84 had 5 CRITICAL/HIGH CodeRabbit findings. Zero replies. Merged.
       → One of those findings was the root cause of BrainBar socket death.

RIGHT: Every CRITICAL/HIGH comment gets one of:
  1. "Fixed in commit abc123" (fix)
  2. "Won't fix because X — [specific technical reason]" (acknowledged)
  3. "Investigating — will address in follow-up PR #N" (deferred with ticket)

Pre-merge checklist (verify before gh pr merge):

All CRITICAL/HIGH comments have a reply
All fixes pushed and re-review requested
No unacknowledged comments from any reviewer

The Receiving Pattern (from /coderabbit)

1. READ: Complete feedback without reacting
2. UNDERSTAND: Restate requirement in own words (or ask)
3. VERIFY: Check against codebase reality
4. EVALUATE: Technically sound for THIS codebase?
5. RESPOND: Technical acknowledgment or reasoned pushback
6. IMPLEMENT: One item at a time, test each

Forbidden responses: NEVER say "You're absolutely right!", "Great point!", "Thanks for catching that!" — instead restate the technical requirement, ask clarifying questions, or just fix it silently.

Implementation order (multi-item feedback):

Clarify anything unclear FIRST
Blocking issues (breaks, security)
Simple fixes (typos, imports)
Complex fixes (refactoring, logic)
Test each fix individually

Max 3 review-fix rounds — skip persistent nitpicks after that.

Classify each review comment:

Type	Action
CRITICAL (data loss, crash, security)	FIX. Blocks merge. Must reply.
MAJOR (real bug)	FIX immediately. Push fix. Re-review. Must reply.
TRIVIAL (style, nitpick)	Fix if genuinely better. Skip if bikeshed.
CONFLICTS WITH DESIGN	INVESTIGATE first. Only dismiss with explicit doc evidence.

When in doubt, the reviewer might be right. Push back only with explicit evidence.

Teaching Each Reviewer (permanent compounding knowledge)

Every PR is an opportunity to make reviewers smarter. Use the right format for each.

Reviewer	How it learns	Reply format	What persists
CodeRabbit	`@coderabbitai` replies → explicit Learnings	`@coderabbitai [explain design]. See [doc]. Please learn this for future reviews.`	Permanent Learning applied to ALL future reviews on this repo
Greptile	Observes all reply patterns passively	Any plain reply explaining the design decision	Updates internal preference model — stops flagging dismissed patterns
Macroscope	`macroscope.md` file in repo root (no reply-learning)	Add rule to `macroscope.md` file	Persists as a repo-level rule, referenced in every future review

CodeRabbit — reply with @coderabbitai [explain design]. Please learn this for future reviews. Never just "I'll leave this as-is" — that teaches nothing.

Greptile — reply naturally with design context. It passively learns from your replies.

Macroscope — add rules to macroscope.md in repo root. No reply-learning.

Rule: Always reply with context. The reply compounds knowledge across every future PR.

Multi-Round Loop (minimum 2 rounds before merge)

Round 1: Push fixes → request re-review from all bots
Round 2: Read re-review → fix any new issues found
Round 3+: Only if new issues surfaced. Max 3 rounds for nitpicks.

CodeRabbit auto-re-reviews on new pushes. For others, comment @bot re-review explicitly.

If reviewer finds new issues in round 2 → fix and go to round 3. Never merge with open issues.

Sanitize Before PR (CRITICAL)

Never put real client data in public PRs:

❌ Real phone numbers, JIDs, group names, client names
❌ Real Supabase row IDs or user UUIDs
✅ Realistic but fake examples: +1-555-0123, client-abc-123, Group: Example Co

Sanitize in PR description, code comments, test fixtures, and commit messages.

After addressing reviews:

git add <files> && git commit -m "fix: address review feedback"
git push
# Wait for re-review — CodeRabbit auto-triggers, others need manual @mention

Only THEN merge (after minimum 2 review rounds):

# Verify reviews are actually in before merging
gh pr view <N> --json reviewDecision,comments
 
gh pr merge <N> --squash --delete-branch
git checkout main && git pull

Worktree-Locked Local Merge (2026-06-07 weave E12)

In multi-worktree repos, gh pr merge <N> --squash --delete-branch can fail locally even though the PR is fully mergeable: the post-merge branch cleanup checks out the default branch, and git refuses when another worktree holds it (failed to run git: fatal: 'main' is already used by worktree at '/Users/etanheyman/Gits/brainlayer-prod').

Verify remote BEFORE retrying. The remote merge often already succeeded — only the local checkout/delete failed: gh pr view <N> --json state,mergedAt,mergeCommit. If state is MERGED, do NOT re-merge; finish cleanup only.
Remote fallback. Merge server-side without touching local checkouts: gh api graphql with the mergePullRequest mutation, or gh api -X PUT repos/{owner}/{repo}/pulls/<N>/merge -f merge_method=squash.
Delete the remote branch explicitly. After a remote/API merge the branch can survive auto-cleanup — check gh api repos/{owner}/{repo}/branches/<branch> and git push origin --delete <branch> if it is still there.
Pull main where main lives. Update the worktree that actually holds the default branch (git -C <main-worktree> pull), not the linked worktree you worked in.

Known topology: brainlayer — main is held by ~/Gits/brainlayer-prod; briefs into multi-worktree repos must pin the exact worktree path (see /repogolem's brainlayer topology note).

Evidence: bl-pr-reviver#3 (local merge refused — 'main' used by brainlayer-prod); codex-orqi-engine#10 (remote merge succeeded, branch survived cleanup); codex-455#3 (same worker bitten twice); bl-orqi-codex#2 (unpinned worktree path).

Stacked PR Branch-Delete Trap

If a child PR is based on the branch you are merging, deleting the base branch can auto-close the child. Closed PRs cannot be retargeted (Cannot change the base branch of a closed pull request; #456 needed replacement #462). Safe order: merge base without --delete-branch, then immediately retarget children. If squash artifacts conflict, rebase children with git rebase --onto origin/<target-branch> origin/<base-branch> while the base exists. Delete the base branch only after children are retargeted, rebased if needed, and mergeable. For golems/master, <target-branch> is master.

Post-Merge Verification

After every merge, verify the merge commit contains the changes from your latest pushed SHA. For squash merges, this is content/tree verification, not ancestor containment. Mid-review external merges can strand fixes (#475 merged without d6a292a; recovered by cherry-pick #476).

Merged is not deployed. Rebuild dist, restart consumers, and live re-test before claiming a fix is live. Evidence: "Docking, fixed, and merged. Are you sure? ... Today's evidence argues against it." (orchestrator 62517efa:1105).

Merge Authority — clean PR-loop = merge (gen-10 weave #48/#21, 2026-06-05)

Etan, paraphrased: "you can merge… that's why we have a PR loop."

The PR loop IS the gate. Once the loop is clean — reviewers invoked, every CRITICAL/MAJOR/HIGH comment fixed-or-replied, CI green, self-QA gates passed (daemon gate + visual self-QA below where they trigger) — merge. Do NOT bolt a heavyweight human diff-review gate on top of a clean loop; that gate was dropped deliberately.

Single-GitHub-account reality: reviewDecision can never reach APPROVED on EtanHey repos (self-approve is structurally blocked). A clean loop with bot reviews replied-to is the approval. Where branch protection blocks the merge, --admin after a clean loop is the standing policy — not a violation.
What this does NOT relax: never merge with 0 reviews (bots count, invoke them); never merge with an unreplied CRITICAL, MAJOR, or HIGH finding (per the classification table above — MAJOR means fix immediately); never skip the daemon or visual gates where they trigger. Functional self-QA happens BEFORE handoff — "merged" ≠ "converged into one verified build."

After Merge: Update Tracking (MANDATORY)

Every merged PR MUST update its tracking. No exceptions.

Collab file — If this PR is part of a collab, update the task board status to ✅ Done with PR number
Roadmap — If this PR completes a roadmap phase, update ~/Gits/orchestrator/roadmap/README.md
BrainLayer — brain_store what changed and why (tagged pr-merged, <project>)

WRONG: Merge PR, exit silently               ← Tracking drift!
WRONG: "I'll update the collab later"         ← You won't. Do it NOW.
WRONG: Only update one of collab/roadmap/BL   ← Update ALL relevant trackers.

If you are an autonomous agent, this step is NON-NEGOTIABLE. The orchestrator should NEVER discover completed work by accident.

Daemon Verification Gate

[!CAUTION] NON-NEGOTIABLE HARD STOP: IF A PR TOUCHES DAEMON, SOCKET, MCP, OR PROTOCOL CODE, YOU MUST COMPLETE REAL CLIENT RUNTIME VERIFICATION BEFORE MERGE. MARKDOWN INTENT IS NOT ENOUGH.

Triggers when: PR touches daemon, socket, MCP, protocol, VoiceBar runtime, or any socket-based daemon code (BrainBar, VoiceBar, cmux MCP, VoiceLayer MCP daemon, flow-bar/**, launchd/**, src/mcp-server*.ts, src/mcp-socket-owner.ts, src/socket-*.ts, src/daemon*.ts, src/paths.ts, src/process-lock.ts, src/resolve-binary.ts, src/whisper-server.ts).

socat/curl/unit tests are NOT sufficient. Real client test required.

For VoiceLayer:

Run ./scripts/voicelayer-verify.sh BEFORE gh pr merge.
If the script says runtime verification is required, rebuild/relaunch VoiceBar when prompted.
Press F5 in the real VoiceBar client, speak verification test, release, and confirm paste fired.
Confirm .verified/verified-runtime-<branch>-<short-sha>.txt exists and contains Verified-Runtime: <sha>.
Add the exact Verified-Runtime: <sha> line to the PR body.
Only then merge.

If you run gh pr merge --admin without the matching verify artifact, you violated the gate. brain_store the violation immediately as an orc-correction with what happened, why the hook/skill failed, and how to prevent recurrence.

For non-VoiceLayer daemon PRs:

Open a NEW cmux pane (cmux new-split right).
Launch a fresh Claude Code session in that pane.
Verify MCP tools are available AND return real results.
If tools are unavailable or return errors, the PR is NOT done.

Why: Real regressions passed markdown-only gates and shell smoke tests. Persistent clients, app focus, hotkeys, paste behavior, and MCP framing failures only show up in live sessions.

Visual Self-QA Gate — builders click IN before merge (gen-10 weave #37, 2026-06-05)

Triggers when: the PR touches anything a human SEES — UI views, dashboards, HTML pages, menu-bar apps (BrainBar/VoiceBar), Phoenix views, TUI surfaces.

The rule: the BUILDER clicks INTO a real running session and screenshots it BEFORE merge. Mechanical checks (PID running, commit-matches, server responds) are NOT a functional pass — "generated" ≠ "verified."

Run the real thing (deployed URL or live app — not the build directory).
Click into a REAL session/state, exercising the changed surface.
Screenshot (desktop + mobile where the surface is mobile-first) and attach the /never-fabricate R7 Verification Receipt to the PR/collab message.
Route visual QA of menu-bar apps to Codex computer-use — Codex has driven BrainBar before; Claude CU is weak there. Known gotcha: the LSUIElement grant dialog is a focus state, not a hard block (fallback: screencapture CLI + coordinate clicks).
Cannot screenshot? Then say so: "VISUAL VERIFICATION NOT PERFORMED — needs manual check before merge." Never claim a visual pass from text tools (R7).

What NOT To Do

WRONG: gh pr create && gh pr merge --auto    ← No review!
WRONG: gh pr create && gh pr merge --squash  ← Same message, no review!
WRONG: "PR created, done!"                   ← Mission is MERGED, not created
WRONG: Skip review "because it's a small change" ← Small changes break things too

Finishing a Branch (Alternative Endings)

Not every branch goes through the full PR flow. When implementation is done:

Verify tests pass before offering options
Present options:

Option	When to Use	Commands
Create PR (default)	Most cases — full review loop	Continue with steps 7-11 above
Merge locally	Small team, already reviewed	`git checkout main && git merge <branch> && git branch -d <branch>`
Keep as-is	Need to park work	Just stop. Worktree preserved.
Discard	Wrong approach, start over	Requires typed "discard" confirmation. `git branch -D <branch>`

Worktree cleanup: For options 1 (after merge), 3 (never), and 4 (after discard):

# Check if in worktree
git worktree list | grep $(git branch --show-current)
# If yes, after merging/discarding:
git worktree remove <worktree-path>

After git worktree add, read the file from the worktree path before editing. Reads from the primary checkout or another worktree do not carry over.

When operating from a linked worktree, do not merge or cleanup from that worktree. Move to the original checkout, run gh pr merge, verify the remote merge SHA, delete the remote branch, then remove the worktree. This avoids the local failure when main is checked out elsewhere and keeps cleanup from deleting the session underneath you. If even the original checkout cannot hold the default branch (bare-mirror-style hubs like brainlayer), use the "Worktree-Locked Local Merge" remote fallback above.

After Merge: Store Component Reasoning

For every NEW file > 50 lines or with non-obvious architecture decisions:

brain_store("New file: {path} ({lines} lines). Purpose: {why}. Key decisions: {list}. Alternatives considered: {list}.",
  tags=["component-reasoning", "{repo}", "pr-{N}"], importance=7)

Future sessions query this instead of opening files to understand design choices.

Composability

This skill is referenced by:

/large-plan — every phase goes through this loop
CodeRabbit commit gate — step 5, this skill is the FULL loop
Collab files — all autonomous work requires this loop

This skill references:

/superpowers:test-driven-development — step 2 (implement with TDD)
/superpowers:verification-before-completion — step 4 (verify before claiming)
/never-fabricate — never claim review is green without reading it

Note: the CodeRabbit commit gate and /coderabbit receiving-side logic are inlined in Steps 5 and 8 respectively. pr-loop is self-contained.

Quick Reference

# The whole loop in commands:
git checkout main && git pull
git checkout -b feat/my-feature
# ... implement with TDD ...
bun test  # or npm test
git add src/changed-file.ts tests/new-test.ts
git commit -m "feat: description
 
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>"
git push -u origin feat/my-feature
gh pr create --title "feat: description" --body "## Summary\n..."
# WAIT for review (60-90s for bots, or run coderabbit:code-reviewer)
gh pr view <N> --comments  # READ the review
# Fix any real bugs, push again if needed
gh pr merge <N> --squash --delete-branch
git checkout main && git pull

Golden

Best Pass Rate

100%

Claude Sonnet

Assertions

2 models tested

Avg Cost / Run

$0.0163

across models

Fastest (p50)

1.8s

Claude Sonnet

Behavior Evals

Phase 2 baseline — skill quality on Claude

Behavior Baseline

Claude Sonnet

100%11/11

●

Adapter Evals

Phase 2C — cross-AI portability

Adapter Portability

Codex (GPT-5.4)

78%7/9

◒

Assertion	Claude Sonnet	Codex (GPT-5.4)	Consensus
mission-is-merged-not-pr-created			2/2
fresh-verification-before-shipping			2/2
review-is-read-before-merge			2/2
main-is-cleaned-up-after-merge			2/2
real-bugs-fixed-before-merge			2/2
review-comments-are-classified			2/2
substantive-fixes-trigger-rereview		—	1/2
false-positives-are-not-blockers		—	1/2
rejects-pr-created-as-done			2/2
review-remains-required-for-small-change		—	1/2
completion-still-includes-merge-and-cleanup		—	1/2
only-true-capability-gaps-are-marked-na	—		0/2
manual-gh-comment-trigger-is-allowed-as-fallback	—		0/2
brainlayer-postmerge-remains-a-real-gap	—		1/2
polling-fallback-uses-real-cli-commands	—		1/2

Token Usage

Claude Sonnet

2,400

Codex (GPT-5.4)

2,950

Input tokensOutput tokens

Cost per Run

Claude Sonnet

$0.0082

Codex (GPT-5.4)

$0.0245

Model	Input Tokens	Output Tokens	Cost / Run	Cost / 1K Runs
Claude Sonnet	1,800	600	$0.0082	$8.20
Codex (GPT-5.4)	2,200	750	$0.0245	$24.50

Model	p50	p95	Overhead
Claude Sonnet	1.8s	3.2s	+78%
Codex (GPT-5.4)	2.8s	4.6s	+64%

Last evaluated: 2026-03-12 · Real Phase 2 evals · behavior (Claude) + adapter (1 CLI)

The Iron Law

Parking Is the Violation (2026-06-12)

Autonomous Agent Mode

The Iron Law

Parking Is the Violation (2026-06-12)

Autonomous Agent Mode

Hierarchical Worker Mode (gen-12 weave E09)

Review-Without-Merge ≠ Draft (gen-12 weave E09)

Lead Merge-Timing (gen-12 weave E09)

Deploy Truth Gate (gen-12 weave E09)

PR-Referenced Artifacts Must Be Committed (gen-12 weave E09)

Single-Account Review Verdicts (gen-12 weave E09)

The Full Loop

Git Sanitization Gate (MANDATORY)

Codex Worktree Patch Gate

Step 7: Creating the PR

Prerequisites

Create the PR

Edge Cases

Step 8: REVIEW (The Critical Step)

NEVER Merge With 0 Reviews

Step 8a: Invoke Reviewers

For private repos (no bot reviewers):

For public repos (bot reviewers configured):

Reading Review Comments

Investigate Before Dismissing (CRITICAL)

No Silent Ignoring (CRITICAL — from dismissed review mining)

The Receiving Pattern (from /coderabbit)

Classify each review comment:

Teaching Each Reviewer (permanent compounding knowledge)

Multi-Round Loop (minimum 2 rounds before merge)

Sanitize Before PR (CRITICAL)

After addressing reviews:

Only THEN merge (after minimum 2 review rounds):

Worktree-Locked Local Merge (2026-06-07 weave E12)

Stacked PR Branch-Delete Trap

Post-Merge Verification

Merge Authority — clean PR-loop = merge (gen-10 weave #48/#21, 2026-06-05)

After Merge: Update Tracking (MANDATORY)

Daemon Verification Gate

Visual Self-QA Gate — builders click IN before merge (gen-10 weave #37, 2026-06-05)

What NOT To Do

Finishing a Branch (Alternative Endings)

After Merge: Store Component Reasoning

Composability

Quick Reference

Behavior Evals

Adapter Evals

Token Usage

Cost per Run

Response Time (p50)

Response Time (p95)