Evaluation framework for healthcare AI agents - benchmarking billing, triage, documentation, prior auth, and clinical reasoning workflows
python benchmark healthcare fhir ehr triage ai-safety evaluation-framework patient-safety fastapi medical-ai medical-coding clinical-ai healthcare-ai prior-authorization llm-evaluation healthcare-agents agent-evaluation clinical-documentation refusal-accuracy
-
Updated
Apr 13, 2026 - Python