Features

What Golems - Autonomous AI Agents can do

55 AI-Agnostic Skills

Same skills, any CLI — Claude, Codex, Cursor, Gemini, Kiro

Skills are written once in universal SKILL.md format, then adapted for each AI CLI via a thin adapters/ layer. A capabilities.yaml file routes each skill to the right adapters based on what each CLI supports. 40 skill eval packs with 480+ assertions ensure quality. 96% pass rate across the eval suite. The adapter layer means skills work across 5 different AI CLIs without rewriting.

55 skills — commit, pr-loop, research, orc, large-plan, and more
3-layer architecture — SKILL.md + adapters/ + capabilities.yaml
40 eval packs — 480+ assertions, fixture-based testing
5 CLIs validated — Claude, Codex, Cursor, Gemini, Kiro
96% pass rate across the full eval suite

Autonomous Coding Loop

PRD stories to working code, unattended

Ralph reads structured PRD stories and spawns fresh Claude instances to implement each one. Every commit is gated behind CodeRabbit AI review — if issues are found, Ralph fixes them automatically (up to 3 attempts). Failed fixes create BUG stories instead of shipping broken code. The cycle continues until all stories are complete.

json

{
  "id": "US-001",
  "title": "Add session export",
  "criteria": [
    "Export sessions to JSON format",
    "Include all enrichment metadata",
    "Run CodeRabbit review - must pass",
    "Commit: feat: US-001 add session export"
  ]
}

PRD story — last 2 criteria are always CodeRabbit + commit

OrcClaude v2.0

Multi-agent orchestrator with planning topology

The orchestrator agent coordinates multi-agent sprints across repos. Planning topology with response markers enables structured delegation. Spawns parallel Claude workers, monitors progress via collab files, and dispatches research to specialized agents. Sequential-parallel collab chains enable fully automated handoffs between agents — zero human intervention.

Planning topology — structured agent delegation
Collab files — async agent-to-agent communication
Sequential-parallel chains — automated handoffs
Sprint coordination — multi-repo, multi-agent
Response markers — structured output from spawned agents

Autonomous Coding Loop

Night Shift + PR Loop v2 — every commit reviewed

Night Shift scans repos at 4am for TODOs and improvements, creates worktrees, implements changes, and gates every commit behind CodeRabbit AI review. PR Loop v2 enforces review on every commit — if issues are found, they're fixed automatically (up to 3 attempts). Failed fixes create BUG stories instead of shipping broken code. Ralph reads PRD stories and spawns fresh Claude instances to implement each one.

Night Shift — 4am launchd trigger, autonomous PRs
PR Loop v2 — review enforcement on every commit
CodeRabbit gate — AI review before merge
Ralph — PRD stories to working code, unattended
3-attempt fix cycle — then BUG story, never broken code

Cloud + Local Split

Railway for cron, Mac for real-time

Railway hosts a single cloud worker running scheduled tasks: email polling (hourly), job scraping (3x/day), and daily briefing generation — all using free Gemini Flash-Lite. macOS handles everything real-time: the Telegram bot on port 3847, BrainLayer memory indexing, VoiceLayer for voice I/O, and Night Shift coding. Total cloud cost: ~$5/month.

Railway — email, jobs, briefing (scheduled)
macOS — Telegram, memory, voice (real-time)
Cloud uses Gemini (free), local uses MLX (free)
~$5/month total cloud infrastructure cost
State: Supabase for cloud, local files for Mac

MCP Server Ecosystem

8 MCP servers powering every golem

Each golem declares which MCP servers it needs. BrainLayer provides 12 memory + KG tools including the new brain_digest with 3 modes and pubsub for real-time updates. VoiceLayer exposes 2 voice tools with daemon architecture. The email server handles triage with 7 tools. Plus Supabase for database, Exa for web search, Sophtron for financial data, and GLM for local inference.

BrainLayer — 12 memory + KG tools, BrainBar daemon
Email — 7 triage & draft tools
VoiceLayer — 2 voice tools, MCP daemon
Supabase — SQL & DDL access
Exa — web search & code context
Sophtron — bank transaction APIs
GLM — local free-tier inference
Jobs — 3 discovery & matching tools

Neural Observatory Dashboard

2D canvas knowledge graph + enrichment explorer

A Next.js dashboard with d3-force 2D canvas knowledge graph visualization, enrichment observatory for search quality analysis, and wiki synthesis panels. Entity detail panels with community clustering let you explore the knowledge graph interactively. Filter panels handle 284K+ chunks without crashing.

2D canvas KG — d3-force graph with zoom and filter
Enrichment observatory — search quality explorer
Wiki synthesis — Neural Observatory Phase 1
Entity detail panels — community clustering
Aggregate-first — handles 284K+ chunks efficiently

Features

What Golems - Autonomous AI Agents can do

55 AI-Agnostic Skills

Same skills, any CLI — Claude, Codex, Cursor, Gemini, Kiro

55 skills — commit, pr-loop, research, orc, large-plan, and more
3-layer architecture — SKILL.md + adapters/ + capabilities.yaml
40 eval packs — 480+ assertions, fixture-based testing
5 CLIs validated — Claude, Codex, Cursor, Gemini, Kiro
96% pass rate across the full eval suite

Autonomous Coding Loop

PRD stories to working code, unattended

json

{
  "id": "US-001",
  "title": "Add session export",
  "criteria": [
    "Export sessions to JSON format",
    "Include all enrichment metadata",
    "Run CodeRabbit review - must pass",
    "Commit: feat: US-001 add session export"
  ]
}

PRD story — last 2 criteria are always CodeRabbit + commit

OrcClaude v2.0

Multi-agent orchestrator with planning topology

Planning topology — structured agent delegation
Collab files — async agent-to-agent communication
Sequential-parallel chains — automated handoffs
Sprint coordination — multi-repo, multi-agent
Response markers — structured output from spawned agents

Autonomous Coding Loop

Night Shift + PR Loop v2 — every commit reviewed

Night Shift — 4am launchd trigger, autonomous PRs
PR Loop v2 — review enforcement on every commit
CodeRabbit gate — AI review before merge
Ralph — PRD stories to working code, unattended
3-attempt fix cycle — then BUG story, never broken code

Cloud + Local Split

Railway for cron, Mac for real-time

Railway — email, jobs, briefing (scheduled)
macOS — Telegram, memory, voice (real-time)
Cloud uses Gemini (free), local uses MLX (free)
~$5/month total cloud infrastructure cost
State: Supabase for cloud, local files for Mac

MCP Server Ecosystem

8 MCP servers powering every golem

BrainLayer — 12 memory + KG tools, BrainBar daemon
Email — 7 triage & draft tools
VoiceLayer — 2 voice tools, MCP daemon
Supabase — SQL & DDL access
Exa — web search & code context
Sophtron — bank transaction APIs
GLM — local free-tier inference
Jobs — 3 discovery & matching tools

Neural Observatory Dashboard

2D canvas knowledge graph + enrichment explorer

2D canvas KG — d3-force graph with zoom and filter
Enrichment observatory — search quality explorer
Wiki synthesis — Neural Observatory Phase 1
Entity detail panels — community clustering
Aggregate-first — handles 284K+ chunks efficiently