AIBox
AIBox runs a self-contained AI platform from the repository's Compose files: a React frontend, a Go gateway, a set of Python services (agent runtime, guardrail, memory, knowledge, audit, sandbox), an inference router, and the backing stores (Keycloak, PostgreSQL, Redis, Qdrant, MinIO). Together they deliver chat, agent tools, RAG, memory, code execution, audit receipts, and model routing from one local deployment.
git clone --recurse-submodules https://github.com/egroup-labs/aibox.git
cd aibox
make ensure-secrets
make up
Open http://localhost. The admin console lives at
http://localhost/admin. Default-realm seed accounts
(admin, testuser) and their generated passwords are documented in the
Quickstart.
What runs
make up brings up the dev compose stack defined in deploy/docker-compose.yml
plus deploy/docker-compose.dev.yml. make up GPU=single adds local vLLM. For a
lighter laptop footprint use make up-lite, and make dev-select to choose which
services run locally (real/stub/off), driven by config/dev-bundles.toml. Everything
else is opt-in through profiles and compose overlays.
Core capabilities
| Area | What it does |
|---|---|
| Chat and agents | One adaptive main agent per session, with focused subagents spawned natively by Codex when the main agent delegates. |
| Tools | Search, scraping, knowledge, memory, code execution, MCP-backed tools, and user-approved actions. |
| Model routing | OpenAI-compatible router with OpenRouter (default) and local vLLM backends, usage capture, route metadata, and observability. |
| Memory | Typed memories scoped by tenant, user, agent, and session. Runtime prefetch injects relevant notes before the model runs. |
| Knowledge | Contextual document RAG and a markdown wiki with tenant, owner, audience, role, and visibility checks. |
| Guardrails | Input/output scanners, optional Constitutional AI, audit logging, and turn-event emission. |
| Code sandbox | Per-session execution containers behind /v1/sandbox/*, with a Docker backend. |
| Identity | Keycloak OIDC with PKCE, mobile-push 2FA, signed X-Aibox-Principal, CapTokens, and service-to-service JWTs. |
| Audit receipts | Hash-chained audit log plus signed turn envelopes, proof exports, /v1/receipts/*, and offline verification. |
Inference modes
The default make up quickstart routes models through OpenRouter, so no local
GPU is required once deploy/secrets/openrouter_api_key contains a valid key.
For local inference, run the GPU profile and route models to vLLM:
make up GPU=single # single local vLLM container
make up GPU=multi # Gemma + Qwen behind the inference router
make up GPU=vision # adds a local vision model (dev only)
No data leaves your environment unless you configure an external model provider or other outbound integration.
Next pages
- Features — capability summary and links to deeper pages.
- Quickstart — first local startup, end-to-end.
- Chat tutorial — use the primary UI.
- Upload documents — index a file and query it through RAG.
- Multi-agent — see the main agent delegate to a Codex-spawned subagent.
- Model configuration — vLLM and OpenRouter routes.
- Authentication — Keycloak, signed principals, CapTokens, internal auth.
- Audit trail — hash chain, turn envelopes, receipts, verification.
- API reference — gateway route map and request shapes.
Verified against commit 5187b91e (2026-06-11) · sources d38d8ad498a4.