AI Agents for healthcare: How to Automate multi-agent systems (single-agent with LangGraph)
Healthcare teams spend too much time moving information between intake, prior auth, utilization review, claims, and patient communications. That work is repetitive, rules-heavy, and expensive, which makes it a good fit for AI agents — but in healthcare, the safer pattern is usually a single orchestrator agent built with LangGraph rather than a swarm of independent agents.
The goal is not to let models “run the business.” The goal is to automate bounded workflows like benefits verification, chart summarization, denial triage, and patient follow-up while keeping humans in control of clinical and compliance decisions.
The Business Case
- •
Prior authorization turnaround drops from 2-5 days to same-day triage
- •A single LangGraph-based agent can extract CPT/ICD-10 codes, check payer requirements, assemble missing documentation, and route exceptions.
- •In practice, teams see 30-50% less manual coordinator time per case and fewer back-and-forth calls with providers.
- •
Claims denials decrease by 10-20% on targeted workflows
- •Most avoidable denials come from missing modifiers, eligibility mismatches, or incomplete documentation.
- •An agent that validates claim packets before submission can reduce rework and lower appeal volume by 15-25% on high-volume service lines.
- •
Clinical documentation support saves 5-10 minutes per encounter
- •For care management or ambulatory settings, summarizing chart history, medications, labs, and recent encounters into structured notes cuts administrative drag.
- •At scale, that translates into hundreds of staff hours per month for a mid-sized health system.
- •
Patient outreach response times improve from hours to minutes
- •Appointment reminders, prep instructions, referral status updates, and post-discharge follow-ups can be automated with strict templates.
- •This reduces no-show rates by 3-8% in many outpatient programs and improves call center deflection without replacing staff.
Architecture
A production healthcare setup should be boring and auditable. Use one orchestrating agent with explicit state transitions instead of letting multiple agents negotiate with each other.
- •
1. Orchestrator layer: LangGraph
- •Use LangGraph to model the workflow as a state machine: intake → classify → retrieve evidence → draft response → human review → finalize.
- •This gives you deterministic control points for HIPAA-sensitive steps and makes audit trails easier to explain to compliance teams.
- •
2. Reasoning and tool layer: LangChain
- •Use LangChain for tool calling against EHR APIs, claims systems, scheduling systems, and document stores.
- •Keep tools narrow:
get_patient_summary,check_eligibility,fetch_policy_rules,draft_denial_response.
- •
3. Retrieval layer: pgvector + structured data
- •Store payer policies, SOPs, clinical templates, and prior authorizations in PostgreSQL with pgvector for semantic retrieval.
- •Pair vector search with hard filters on payer name, plan type, state, effective date, and service line so the model does not hallucinate policy applicability.
- •
4. Governance layer: audit logs + access controls
- •Log every prompt, retrieved document ID, tool call, output version, and human override.
- •Enforce role-based access control through your identity provider and keep PHI scoped to minimum necessary access under HIPAA.
A practical stack looks like this:
| Layer | Example Technologies | Why it matters |
|---|---|---|
| Workflow orchestration | LangGraph | Deterministic multi-step automation |
| Agent tooling | LangChain | Controlled API integrations |
| Retrieval | PostgreSQL + pgvector | Policy/document lookup with traceability |
| Data sources | Epic/Cerner APIs, claims platform, CRM | Real operational context |
| Security/compliance | SOC 2 controls, KMS encryption, audit logs | Required for enterprise healthcare procurement |
If you operate across the EU or handle EU resident data, add GDPR controls for lawful basis, retention limits, and data subject rights. If you are in a payer-adjacent environment handling financial risk workflows alongside healthcare operations, align operational controls with the same discipline you would expect under Basel-style governance: clear approvals, traceability, and exception handling.
What Can Go Wrong
- •
Regulatory risk: PHI leakage or unsafe automation
- •A model can expose protected health information if prompts are logged carelessly or retrieval scopes are too broad.
- •Mitigation: tokenize or redact where possible; keep PHI inside approved systems; use least privilege; encrypt at rest/in transit; maintain BAA coverage; run periodic HIPAA security reviews.
- •
Reputation risk: incorrect patient-facing or clinical-adjacent output
- •A wrong appointment instruction or inaccurate benefits explanation creates immediate trust damage.
- •Mitigation: constrain outputs to approved templates; require human sign-off for anything clinical; use confidence thresholds; show source citations from payer policy or chart data; block free-form generation for high-risk messages.
- •
Operational risk: brittle workflows that break under edge cases
- •Healthcare data is messy: missing IDs, duplicate charts, inconsistent payer rules across states.
- •Mitigation: design fallback paths in LangGraph for exceptions; route low-confidence cases to staff; start with one workflow only; monitor exception rates daily during pilot.
Getting Started
- •
Pick one narrow workflow with measurable volume
- •Good candidates are prior auth intake for imaging/procedures, denial classification, referral routing, or post-discharge outreach.
- •Avoid anything that requires autonomous clinical judgment in phase one.
- •
Build a 6-8 week pilot with a small cross-functional team
- •You need:
- •1 product owner from operations
- •1 backend engineer
- •1 ML/agent engineer
- •1 security/compliance partner
- •1 SME from nursing revenue cycle or patient services
- •Keep the scope tight enough to validate ROI before broad rollout.
- •You need:
- •
Instrument everything before go-live
- •Track:
- •cycle time per case
- •human override rate
- •error rate by task type
- •cost per transaction
- •percentage of cases resolved without escalation
- •If you cannot measure baseline performance first week versus pilot week six, you do not have an enterprise case yet.
- •Track:
- •
Harden the system before expansion
- •Add red-team tests for PHI leakage, prompt injection, bad retrieval, and policy drift.
- •Review controls against HIPAA, GDPR where applicable, and your SOC 2 program before scaling to more departments.
The right pattern in healthcare is not “more agents.” It is one controlled agentic workflow with strong guardrails. If you start with a single LangGraph orchestrator around a well-defined operational problem, you get automation that is measurable, auditable, and realistic enough to pass security review.
Keep learning
- •The complete AI Agents Roadmap — my full 8-step breakdown
- •Free: The AI Agent Starter Kit — PDF checklist + starter code
- •Work with me — I build AI for banks and insurance companies
By Cyprian Aarons, AI Consultant at Topiax.
Want the complete 8-step roadmap?
Grab the free AI Agent Starter Kit — architecture templates, compliance checklists, and a 7-email deep-dive course.
Get the Starter Kit