Skip to main content

2 posts tagged with "architecture"

View All Tags

Building a Self-Hosted AI Platform from Scratch

· 5 min read

Most AI platforms assume you are comfortable sending your data to someone else's servers. For a growing number of organizations, that assumption is wrong. Regulated industries, defense contractors, research labs, and privacy-conscious companies need AI capabilities where no data leaves their network. That is the problem AI-in-a-Box was built to solve.

Current architecture: This post describes the product thesis and early architecture. The current service map, auth model, subagent model, and receipt system are documented in the Architecture reference, Authentication reference, and Audit Trail reference.

Claude Code-Style Agents: Sub-Agent Spawning and Skill Loading

· 5 min read

Claude Code has a pattern we wanted to replicate: when a task is complex enough, the main agent spawns a focused sub-agent with a custom system prompt and a curated set of tools. The sub-agent does its work and returns results to the parent. This keeps the main agent's context clean and lets specialized work happen in isolation.

We built this into AI-in-a-Box's agent runtime using the OpenAI Agents SDK.

Current runtime model: This post is historical. The current implementation uses a main agent with a Delegate tool and subagent definitions in deploy/config/subagents/*.md; the old YAML handoff team model has been removed. Use the Agents reference and Multi-Agent tutorial for current behavior.