What are the risks of building AI systems in production?
Short answer: Seven risks matter, none are existential, and all have known mitigations. The teams that get burned aren't the ones who hit unknown problems — they're the ones who didn't bother with the known mitigations.
1. Cost runaway
Risk: One bad prompt or feature flag triples your bill before anyone notices.
Mitigation: Per-feature token budgets, per-request cost tracing, alerts on spend velocity (not just totals). Treat AI cost like any other resource — observability + guardrails. Build effort: 1 day, prevents the most common surprise in our incident research.
2. Silent prompt regression
Risk: Vendor pushes a "minor" model update; your output structure changes; downstream parsers break.
Mitigation: Pin model versions explicitly. Run CI evals on every prompt change. Replay production traces against new model versions before cutover. The cheapest insurance you can buy.
3. Sensitive data leakage
Risk: PII or trade secrets get logged or sent to third-party providers without DPAs.
Mitigation: Redaction gateway between your app and any LLM. Vendor allowlist. Per-prompt audit log. For regulated data: private inference (vLLM in your VPC, or Bedrock/Vertex with private endpoints).
4. Hallucination in user-facing outputs
Risk: Model invents a citation, a price, a contract clause. Customer sees it. Trust gone.
Mitigation: Ground every claim in retrieval. Constrain output formats with schemas. Show the model's source. For high-stakes outputs: human-in-the-loop or confidence-gated automation.
5. Vendor lock-in
Risk: Your prompts and tool definitions are tied to one provider. Switching costs are 2 quarters of work.
Mitigation: LLM gateway abstraction (LiteLLM, Portkey, or your own thin wrapper). Provider-neutral prompt format. Provider-neutral tool calling format (MCP helps here). You don't need to multi-vendor; you need to be able to.
6. Agent loops with no termination
Risk: Agent retries the same broken tool call forever. We saw $4,200 burned in 6 hours.
Mitigation: Hard iteration limits per agent run. Cost circuit breaker per session. Idempotency keys on tool calls so retries don't compound damage. Budget alarm at 50% of expected daily spend.
7. Security vulnerabilities in tool servers
Risk: MCP server treated as an internal script. No auth. No rate limiting. Eventually pointed at production data.
Mitigation: Treat every MCP/tool server as a production service. OAuth scopes per consumer. Rate limits. Audit logs. Penetration test before exposing to agents that touch external systems.
What's not on this list (and shouldn't be)
- "AI replacing engineers." Not a production risk; it's a workforce question.
- "AI becoming sentient." Not a 2026 concern.
- "Models getting worse over time." Real but rare (4% of incidents in our research). Caught by evals.
The risk profile, summarized
Production AI is no riskier than any other distributed system if engineered like one. The teams who get burned are the ones who treat AI features as a special exemption from the discipline they apply to other production code. They aren't.
Want a risk register tailored to your build? It's a deliverable in our assessment phase.