RoboDev¶

Kubernetes-native AI coding agent harness. Orchestrate Claude Code, OpenAI Codex, Aider, and OpenCode to perform maintenance and development tasks on your codebases — autonomously, at scale, with enterprise-grade guard rails.

5-Minute Quick Start

Get RoboDev running locally with Docker Compose — no Kubernetes required.

Docker Compose quick start
Deploy on Kubernetes

Install with Helm, configure GitHub Issues + Claude Code, and run your first task.

Kubernetes quick start
Six Layers of Guard Rails

Controller validation, engine hooks, repo-level rules, task profiles, quality gates, and a progress watchdog.

Guard rails overview
Extensible Plugin System

Ticketing, notifications, secrets, SCM, approvals, and reviews — all pluggable via gRPC.

Writing a plugin

How It Works¶

graph LR
    A["🎫 Issue Tracker<br/>(GitHub / GitLab / Jira)"] --> B["🎛️ RoboDev Controller"]
    B --> C["🛡️ Guard Rails"]
    C --> D["⚙️ Engine Selector"]
    D --> E["📦 K8s Job<br/>(AI Agent Pod)"]
    E --> F["🔀 Pull Request"]
    F --> G["✅ Review & Merge"]

RoboDev watches your issue tracker for labelled tickets, validates them against configurable guard rails, spins up a sandboxed AI coding agent in a Kubernetes Job, and opens a pull request with the result. The entire flow is automated, observable, and safe.

Key Features¶

Feature	Description
Multi-engine	Claude Code, Codex, Aider, OpenCode — with automatic fallback chains
Defence in depth	Six independent guard rail layers prevent unsafe agent behaviour
Plugin architecture	Extend ticketing, notifications, secrets, SCM, approvals, and reviews via gRPC
Kubernetes-native	Operator pattern with Karpenter integration and KEDA scaling; leader election is on the roadmap
Observable	Prometheus metrics, structured JSON logging, Grafana dashboards
Multi-tenant	Multi-tenancy config schema is defined; namespace-per-tenant runtime isolation is planned for a future release
Security-capable	Distroless images, read-only filesystems, drop-all capabilities; NetworkPolicies and gVisor sandboxing are available but disabled by default

Intelligent Agent Management¶

RoboDev goes beyond basic orchestration with integrated intelligence systems that improve agent performance over time:

Subsystem	Status	What It Does
Real-Time Agent Coaching (PRM)	Active	Scores agent productivity at each tool call and intervenes with guidance before problems escalate
Episodic Memory	Active	Accumulates knowledge across all tasks — prior failures, repo quirks, and engine strengths feed into future prompts
LLM Abstraction	Active	DSPy-inspired typed signatures, composable modules, and budget-aware LLM calls for all subsystems
Causal Diagnosis	Active	Classifies why a task failed and generates targeted corrective instructions for retry
Adaptive Watchdog	Active	Learns what "normal" looks like per repo/engine/task type and adjusts anomaly thresholds
Intelligent Routing	Active	Routes tasks to the engine most likely to succeed based on historical data
Cost Estimation	Active	Predicts cost and duration before launch — "Predicted: $12-18, 45-90 min"
Competitive Execution	Active	Runs multiple engines in parallel, judges results, selects the best solution

Project Layout¶

The repository is organised as follows:

cmd/robodev/              — Main entrypoint for the controller binary
internal/                 — Private packages used only by the controller
  controller/             — controller-runtime reconciler (reconciliation loop)
  jobbuilder/             — Translates ExecutionSpecs into Kubernetes Jobs
  sandboxbuilder/         — Sandbox CR builder (gVisor / Kata runtime classes)
  taskrun/                — TaskRun state machine, idempotency, and store
  watchdog/               — Progress watchdog loop (loop/stall/thrash detection)
  agentstream/            — Real-time NDJSON streaming from agent pods
  config/                 — Configuration loading and validation
  metrics/                — Prometheus metric definitions
  webhook/                — Webhook receiver (GitHub / GitLab / Slack / Shortcut / generic)
  secretresolver/         — Task-scoped secret resolution and policy enforcement
  promptbuilder/          — Prompt construction with task profiles and workflows
  prm/                    — Process Reward Model for real-time agent coaching
  memory/                 — Episodic memory with temporal knowledge graph
  diagnosis/              — Causal failure diagnosis and informed retry
  routing/                — Engine fingerprinting and intelligent task routing
  estimator/              — Predictive cost and duration estimation
  tournament/             — Competitive execution with tournament selection
pkg/                      — Public packages importable by plugins and SDKs
  engine/                 — ExecutionEngine interface and built-in engine implementations
  plugin/                 — gRPC plugin host and all six plugin interfaces
proto/                    — Protobuf definitions (source of truth for all interfaces)
charts/robodev/           — Helm chart for deploying RoboDev on Kubernetes
docker/                   — Dockerfiles for the controller and each engine
  controller/             — Multi-stage build producing a distroless controller image
  engine-claude-code/     — Claude Code engine container image
  engine-codex/           — OpenAI Codex engine container image
  engine-opencode/        — OpenCode engine container image
  engine-cline/           — Cline engine container image (community template)
examples/                 — Example configurations and plugin implementations
  github-slack/           — GitHub Issues + Slack notification example values
  gitlab-teams/           — GitLab Issues + Microsoft Teams example values
  enterprise/             — Enterprise deployment patterns
  karpenter/              — Karpenter NodePool example for agent workloads
  keda/                   — KEDA ScaledObject example for queue-based scaling
  plugins/                — Example third-party plugins (Jira, Teams)
hack/                     — Developer scripts and local development helpers
  kind-config.yaml        — Kind cluster configuration for local development
  values-dev.yaml         — Helm values overlay for local dev deployments
  values-live.yaml        — Helm values overlay for live end-to-end testing
  values-test.yaml        — Helm values overlay for integration test deployments
  setup-secrets.sh        — Interactive script to provision K8s secrets for testing
  run-integration-tests.sh — Orchestrated test runner with markdown report output
scripts/                  — Utility scripts (dependency installation)
tests/                    — Test suites
  integration/            — Integration tests (no cluster required)
  e2e/                    — End-to-end tests (require a Kind cluster)
docs/                     — Documentation source (this site)

Licence¶

RoboDev is released under the Apache 2.0 licence.