Build, test & deploy AI agents — from YAML to production.
HoloDeck is an open-source experimentation platform for AI agents. Define agents in YAML, run hypothesis-driven tests, and deploy production-ready APIs with a simple CLI — no custom infra required.
What is HoloDeck?
HoloDeck is a no-code agent development platform for engineering teams. Describe agents in YAML, run tests and evaluations from the CLI, and ship them as FastAPI services — locally or in your own cloud.
No-Code Agents
Models, prompts, tools, memory, and vector stores defined in YAML.
Hypothesis Testing
Turn real user journeys into structured test cases and evals.
Integrated Metrics
Groundedness, relevance, F1, BLEU, ROUGE, and more in one CLI.
Deploy Anywhere
holodeck deploy converts configs to FastAPI services.
Designed for how engineers build agents
HoloDeck covers the full lifecycle — definition, testing, orchestration, deployment, and observability — with a CLI-first, YAML-native approach.
holodeck test, holodeck experiment, and
holodeck deploy are CLI-native and CI-friendly.
From idea to production in three steps
Go from a hypothesis about an agent to a monitored, production-ready API in minutes — not weeks.
Multi-agent systems without bespoke glue code
Real systems use more than one agent. HoloDeck gives you declarative orchestration patterns so you can model routers, specialists, and evaluators in YAML.
Use orchestration patterns inspired by the Microsoft Agent Framework to chain agents for parsing, extraction, summarization, compliance checks, and more — without inventing another internal framework.
Built for engineering teams — not click-through demos
Platform & ML Engineers
Standardize how agents are defined, tested, and deployed across teams. Give developers a shared YAML contract instead of bespoke scripts.Product & Application Engineers
Ship AI features quickly without owning prompt infrastructure. Focus on user experience, not wiring models and tools.Data & Research Teams
Run structured experiments, compare agent variants, and evaluate responses using metrics tuned to your domain.Modern AI Products
Customer support agents, research assistants, code assistants, sales agents, and more — all defined in YAML and deployed with a CLI.Observe, measure, and improve — together
OpenTelemetry & cost tracking
HoloDeck emits traces, metrics, and logs following the semantic conventions for generative AI. Track latency, token usage, and cost per agent, per environment.Open source & MIT licensed
HoloDeck is built in the open. Join the community, open issues, propose features, and help shape the agent platform you want to use.View the repo →