What is state machines in AI Agents? A Guide for CTOs in insurance
State machines are a way to model an AI agent as a set of states, with clear rules for how it moves from one state to another. In practice, they define what the agent is doing right now, what it can do next, and what event causes the change.
For insurance CTOs, this matters because most agent failures come from unclear process flow, not weak model output. A state machine gives you control over an AI agent’s behavior across claims intake, document checks, approvals, escalations, and handoffs.
How It Works
Think of a state machine like an insurance claim workflow board.
A claim does not go from “submitted” to “paid” in one jump. It moves through states:
- •Submitted
- •Validation
- •Pending documents
- •Under review
- •Approved
- •Rejected
- •Paid
Each state has allowed transitions. If the required document arrives, the claim moves from Pending documents to Under review. If fraud signals are detected, it may move to Manual review. If payment fails, it may move to Payment exception.
That is the core idea: the AI agent is not free-running. It is operating inside a controlled process.
For AI agents, this is especially useful because the model can be good at language but bad at orchestration. The state machine decides:
- •what the agent should ask next
- •whether it can call a tool
- •when to stop and wait for a human
- •when to retry versus escalate
- •which branch of the workflow is valid
A simple example in plain English:
- •Customer starts a motor claim.
- •Agent enters
Intakestate. - •It collects policy number, date of loss, and photos.
- •If all required data is present, move to
Validation. - •If anything is missing, move to
Awaiting customer. - •Once validated, route to
Assessment. - •If severity is low, auto-process.
- •If severity is high or uncertain, hand off to an adjuster.
This is similar to how an elevator works. It does not randomly choose floors. It only moves based on button presses and internal rules. An AI agent with a state machine behaves the same way: predictable movement, controlled transitions, no improvisation in critical paths.
Why It Matters
CTOs in insurance should care because state machines reduce operational risk in production AI systems.
- •
They make behavior predictable
Insurance workflows need auditability. A state machine gives you a traceable path for every decision: where the agent was, why it moved, and what triggered the transition.
- •
They reduce hallucination impact
The model can still generate text, but it cannot skip mandatory steps or invent process shortcuts if transitions are enforced by code.
- •
They improve human handoff
Claims and underwriting often need escalation. State machines make it explicit when the agent must pause and route to a human reviewer.
- •
They support compliance and controls
You can encode business rules such as “do not approve claims above threshold without supervisor review” or “require identity verification before policy changes.”
| Without State Machine | With State Machine |
|---|---|
| Agent may jump between tasks unpredictably | Agent follows defined workflow states |
| Harder to audit decisions | Every transition can be logged |
| More brittle under edge cases | Clear fallback paths for exceptions |
| Human handoff is ad hoc | Human escalation is part of the design |
For engineering teams, this also makes testing easier. You can test each state independently and verify that invalid transitions fail fast instead of creating silent workflow corruption.
Real Example
Consider an insurance FNOL flow for auto claims.
A customer reports an accident through chat or voice. The AI agent uses a state machine like this:
Start
-> Intake
-> Policy lookup
-> Loss validation
-> Damage assessment
-> Routing decision
-> Straight-through processing
-> Human adjuster review
-> Fraud investigation
Here’s how it works in practice:
- •In
Intake, the agent gathers date/time/location/vehicle details. - •In
Policy lookup, it checks whether coverage was active at time of loss. - •In
Loss validation, it confirms the incident matches policy terms. - •In
Damage assessment, it asks for photos and estimates severity. - •In
Routing decision, business rules decide the next step:- •low severity + clean history -> straight-through processing
- •missing evidence -> request more documents
- •suspicious patterns -> fraud queue
- •high value claim -> adjuster review
The LLM can handle conversation naturally in each state:
- •asking follow-up questions,
- •summarizing customer input,
- •explaining next steps in plain language.
But the state machine controls process integrity:
- •no payment before validation,
- •no policy update before identity verification,
- •no closure before mandatory evidence collection.
That separation is what makes this production-ready.
A common pattern looks like this:
from enum import Enum
class ClaimState(Enum):
INTAKE = "intake"
POLICY_LOOKUP = "policy_lookup"
VALIDATION = "validation"
ASSESSMENT = "assessment"
REVIEW = "review"
CLOSED = "closed"
def next_state(current_state, event):
transitions = {
ClaimState.INTAKE: {"policy_found": ClaimState.POLICY_LOOKUP},
ClaimState.POLICY_LOOKUP: {"valid_policy": ClaimState.VALIDATION},
ClaimState.VALIDATION: {"needs_review": ClaimState.REVIEW,
"auto_approve": ClaimState.CLOSED},
}
return transitions.get(current_state, {}).get(event)
In real systems you would add persistence, retries, timeout handling, idempotency keys, and audit logs. The point is not the syntax; it’s that every action has a legal place in the workflow.
Related Concepts
- •Finite State Machines (FSMs) — the basic version with fixed states and transitions.
- •Workflow orchestration — coordinating multi-step business processes across services and humans.
- •Agent memory — storing context across states without letting memory override process rules.
- •Tool calling — letting agents invoke APIs only when the current state allows it.
- •Human-in-the-loop design — routing uncertain or high-risk cases to underwriters or claims handlers.
Keep learning
- •The complete AI Agents Roadmap — my full 8-step breakdown
- •Free: The AI Agent Starter Kit — PDF checklist + starter code
- •Work with me — I build AI for banks and insurance companies
By Cyprian Aarons, AI Consultant at Topiax.
Want the complete 8-step roadmap?
Grab the free AI Agent Starter Kit — architecture templates, compliance checklists, and a 7-email deep-dive course.
Get the Starter Kit