Skip to content

AgenticHealthAI/Awesome-AI-Agents-for-Healthcare

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

142 Commits
 
 
 
 
 
 

Repository files navigation

Awesome AI Agents for Healthcare

Awesome PRs Welcome Read Our Survey Star on GitHub

This repository is a curated list of research papers, projects, and resources related to the application of Agentic AI / AI agents for healthcare, including medical image analysis, EHR manipulation, counseling, drug discovery, patient dialogue, and healthcare administration. AI agents refer to artificial intelligence systems that can autonomously perform tasks, make decisions, and interact with their environment, often through the use of large language models (LLMs), multi-agent systems, and tool integrations.

  1. The image below introduces a comprehensive conceptual framework. It provides a holistic view, detailing the pipeline from initial data perception and foundational agent capabilities to a hierarchical application ecosystem.

Overall Landscape

  1. We conducted a quantitative analysis of recent academic literature, with the key findings summarised in the image below. This analysis provides a data-driven snapshot of the field’s growth trajectory, technological underpinnings, and application hotspots:
  • Top Data Modalities: Textual data remains the most frequently utilized modality. Time-Series and Genomics exhibit a high proportion of publications from 2025.
  • Top Technologies: The technological focus is heavily concentrated on three topics: 1) developing highlevel Frameworks, 2) enhancing agent Reasoning, and 3) designing Multi-Agent collaboration paradigms.
  • Top Application Domains: Agentic AI continues to be widely applied in broad domains like General Medicine, Public Health, and Mental Health. Drug Discovery and Genomics are particularly new frontiers.

Statistics for Research Trends

We will try to keep this list updated. If you find any errors or any missing papers, please don't hesitate to open issues or pull requests.

📘 Read our survey paper here: A Comprehensive Survey of AI Agents in Healthcare

If you find our paper and repository helpful, please cite:

@article{xu2026comprehensive,
  title={A comprehensive survey of AI Agents in Healthcare},
  author={Xu, Gelei and Li, Xueyang and Chen, Yixiong and Duan, Yuying and Wu, Shuqing and Yu, Haoxinran and Chiu, Ching-Hao and Ni, Juntong and Tang, Ningzhi and Li, Toby Jia-Jun and others},
  journal={Journal of Biomedical Informatics},
  pages={105045},
  year={2026},
  publisher={Elsevier}
}

Table of Contents


Latest Papers

Year 2026

  1. [arxiv 2026.4] CARE: Privacy-Compliant Agentic Reasoning with Evidence Discordance [paper]
  2. [arxiv 2026.3] SkinGPT-X: A Self-Evolving Collaborative Multi-Agent System for Transparent and Trustworthy Dermatological Diagnosis [paper]
  3. [arxiv 2026.3] Symphony for Medical Coding: A Next-Generation Agentic System for Scalable and Explainable Medical Coding [paper]
  4. [arxiv 2026.3] Towards a Medical AI Scientist [paper] [Project]
  5. [arxiv 2026.3] Improving Clinical Diagnosis with Counterfactual Multi-Agent Reasoning [paper]
  6. [IEEE ICHI 2026] MediHive: A Decentralized Agent Collective for Medical Reasoning [paper]
  7. [arxiv 2026.3] Autonomous Agent-Orchestrated Digital Twins (AADT): Leveraging the OpenClaw Framework for State Synchronization in Rare Genetic Disorders [paper]
  8. [arxiv 2026.3] ClinicalAgents: Multi-Agent Orchestration for Clinical Decision Making with Dual-Memory [paper]
  9. [arxiv 2026.3] Doctorina MedBench: End-to-End Evaluation of Agent-Based Medical AI [paper]
  10. [arxiv 2026.3] Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos [paper] [Github] [Project]
  11. [arxiv 2026.3] OMIND: Framework for Knowledge Grounded Finetuning and Multi-Turn Dialogue Benchmark for Mental Health LLMs [paper]
  12. [CHI 2026 Workshop] Rethinking Health Agents: From Siloed AI to Collaborative Decision Mediators [paper]
  13. [arxiv 2026.3] MedOpenClaw: Auditable Medical Imaging Agents Reasoning over Uncurated Full Studies [paper] [Github] [Project]
  14. [arxiv 2026.3] Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA [paper]
  15. [CVPR 2026 Findings] CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare [paper]
  16. [ML4H 2025] Dialogue to Question Generation for Evidence-based Medical Guideline Agent Development [paper]
  17. [arxiv 2026.3] From Physician Expertise to Clinical Agents: Preserving, Standardizing, and Scaling Physicians' Medical Expertise with Lightweight LLM [paper]
  18. [arxiv 2026.3] Can LLM Agents Generate Real-World Evidence? Evaluating Observational Studies in Medical Databases [paper] [Github]
  19. [arxiv 2026.3] Cerebra: A Multidisciplinary AI Board for Multimodal Dementia Characterization and Risk Assessment [paper]
  20. [arxiv 2026.3] Agentic Automation of BT-RADS Scoring: End-to-End Multi-Agent System for Standardized Brain Tumor Follow-up Assessment [paper]
  21. [arxiv 2026.3] Unified-MAS: Universally Generating Domain-Specific Nodes for Empowering Automatic Multi-Agent Systems [paper] [Github]
  22. [ICRA 2026] Anatomical Prior-Driven Framework for Autonomous Robotic Cardiac Ultrasound Standard View Acquisition [paper]
  23. [arxiv 2026.3] Position: Multi-Agent Algorithmic Care Systems Demand Contestability for Trustworthy AI [paper]
  24. [arxiv 2026.3] Caging the Agents: A Zero Trust Security Architecture for Autonomous AI in Healthcare [paper]
  25. [arxiv 2026.3] OpenHospital: A Thing-in-itself Arena for Evolving and Benchmarking LLM-based Collective Intelligence [paper]
  26. [arxiv 2026.3] MedPriv-Bench: Benchmarking the Privacy-Utility Trade-off of Large Language Models in Medical Open-End Question Answering [paper]
  27. [arxiv 2026.3] EviAgent: Evidence-Driven Agent for Radiology Report Generation [paper]
  28. [arxiv 2026.3] Six Interventions for the Responsible and Ethical Implementation of Medical AI Agents [paper]
  29. [arxiv 2026.3] TheraAgent: Multi-Agent Framework with Self-Evolving Memory and Evidence-Calibrated Reasoning for PET Theranostics [paper]
  30. [arxiv 2026.3] When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows [paper]
  31. [arxiv 2026.3] UAV-MARL: Multi-Agent Reinforcement Learning for Time-Critical and Dynamic Medical Supply Delivery [paper]
  32. [arxiv 2026.3] MedMASLab: A Unified Orchestration Framework for Benchmarking Multimodal Medical Multi-Agent Systems [paper]
  33. [arxiv 2026.3] RexDrug: Reliable Multi-Drug Combination Extraction through Reasoning-Enhanced LLMs [paper] [Github]
  34. [arxiv 2026.3] YAQIN: Culturally Sensitive, Agentic AI for Mental Healthcare Support Among Muslim Women in the UK [paper]
  35. [arxiv 2026.3] Empowering Locally Deployable Medical Agent via State Enhanced Logical Skills for FHIR-based Clinical Tasks [paper]
  36. [arxiv 2026.3] Computational Pathology in the Era of Emerging Foundation and Agentic AI [paper]
  37. [arxiv 2026.3] Shifting Adaptation from Weight Space to Memory Space: A Memory-Augmented Agent for Medical Image Segmentation [paper]
  38. [arxiv 2026.3] Evolving Medical Imaging Agents via Experience-driven Self-skill Discovery [paper]
  39. [arxiv 2026.3] Meissa: Multi-modal Medical Agentic Intelligence [paper] [Github]
  40. [ICLR 2026] CARE: Towards Clinical Accountability in Multi-Modal Medical Reasoning with an Evidence-Grounded Agentic Framework [paper]
  41. [ICLR 2026] ATPO: Adaptive Tree Policy Optimization for Multi-Turn Medical Dialogue [paper]
  42. [EACL 2026 Workshop] Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis? [paper]
  43. [arxiv 2026.3] MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus [paper]
  44. [arxiv 2026.3] MedCollab: Causal-Driven Multi-Agent Collaboration for Full-Cycle Clinical Diagnosis via IBIS-Structured Argumentation [paper]
  45. [arxiv 2026.3] From Conflict to Consensus: Boosting Medical Reasoning via Multi-Round Agentic RAG [paper] [Github]
  46. [arxiv 2026.3] MIND: Unified Inquiry and Diagnosis RL with Criteria Grounded Clinical Supports for Psychiatric Consultation [paper]
  47. [arxiv 2026.3] DUCX: Decomposing Unfairness in Tool-Using Chest X-ray Agents [paper]
  48. [arxiv 2026.3] OPGAgent: An Agent for Auditable Dental Panoramic X-ray Interpretation [paper]
  49. [arxiv 2026.3] TARSE: Test-Time Adaptation via Retrieval of Skills and Experience for Reasoning Agents [paper]
  50. [arxiv 2026.3] ProtRLSearch: A Multi-Round Multimodal Protein Search Agent with Large Language Models Trained via Reinforcement Learning [paper]
  51. [arxiv 2026.3] A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series [paper]
  52. [HealthSec/ACSAC 2026] Goal-Driven Risk Assessment for LLM-Powered Systems: A Healthcare Case Study [paper]
  53. [arxiv 2026.2] 3DMedAgent: Unified Perception-to-Understanding for 3D Medical Analysis [paper]
  54. [arxiv 2026.2] Can Agents Distinguish Visually Hard-to-Separate Diseases in a Zero-Shot Setting? [paper] [Github]
  55. [arxiv 2026.2] Which Tool Response Should I Trust? Tool-Expertise-Aware Chest X-ray Agent with Multimodal Agentic Learning [paper]
  56. [arxiv 2026.2] MedClarify: An Information-Seeking AI Agent for Medical Diagnosis with Case-Specific Follow-up Questions [paper]
  57. [arxiv 2026.2] LAMMI-Pathology: A Tool-Centric Bottom-Up LVLM-Agent Framework for Molecularly Informed Medical Intelligence in Pathology [paper]
  58. [arxiv 2026.2] NutriOrion: A Hierarchical Multi-Agent Framework for Personalized Nutrition Intervention Grounded in Clinical Guidelines [paper]
  59. [arxiv 2026.2] TRACE: Temporal Reasoning via Agentic Context Evolution for Streaming Electronic Health Records [paper]
  60. [arxiv 2026.2] CoMMa: Contribution-Aware Medical Multi-Agents From A Game-Theoretic Perspective [paper]
  61. [AAAI 2026 Workshop] SynthAgent: A Multi-Agent LLM Framework for Realistic Patient Simulation [paper]
  62. [arxiv 2026.2] MedCoG: Maximizing LLM Inference Density in Medical Reasoning via Meta-Cognitive Regulation [paper]
  63. [arxiv 2026.2] Picking the Right Specialist: Attentive Neural Process-based Selection of Task-Specialized Models as Tools for Agentic Healthcare Systems [paper]
  64. [arxiv 2026.2] A Multi-Agent Framework for Medical AI: Leveraging Fine-Tuned GPT, LLaMA, and DeepSeek R1 for Evidence-Based and Bias-Aware Clinical Query Processing [paper]
  65. [arxiv 2026.2] MedScope: Incentivizing "Think with Videos" for Clinical Reasoning via Coarse-to-Fine Tool Calling [paper]
  66. [arxiv 2026.2] MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs [paper]
  67. [arxiv 2026.2] Advancing AI Trustworthiness Through Patient Simulation: Risk Assessment of Conversational Agents for Antidepressant Selection [paper]
  68. [arxiv 2026.2] LiveMedBench: A Contamination-Free Medical Benchmark for LLMs with Automated Rubric Evaluation [paper]
  69. [arxiv 2026.2] Closing Reasoning Gaps in Clinical Agents with Differential Reasoning Learning [paper]
  70. [ICHI 2026] Human-Guided Agentic AI for Multimodal Clinical Prediction: Lessons from the AgentDS Healthcare Benchmark [paper]
  71. [arxiv 2026.2] ALPACA: A Reinforcement Learning Environment for Medication Repurposing and Treatment Optimization in Alzheimer's Disease [paper]
  72. [arxiv 2026.2] The Doctor Will (Still) See You Now: On the Structural Limits of Agentic AI in Healthcare [paper]
  73. [arxiv 2026.2] Agentic AI, Medical Morality, and the Transformation of the Patient-Physician Relationship [paper]
  74. [IEEE Access 2026] Agentic AI in Healthcare & Medicine: A Seven-Dimensional Taxonomy for Empirical Evaluation of LLM-based Agents [paper]
  75. [arxiv 2026.2] MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning [paper] [Github]
  76. [arxiv 2026.2] Pruning Minimal Reasoning Graphs for Efficient Retrieval-Augmented Generation [paper]
  77. [arxiv 2026.2] RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis [paper]
  78. [arxiv 2026.2] MedBeads: An Agent-Native, Immutable Data Substrate for Trustworthy Medical AI [paper]
  79. [arxiv 2026.2] AutoHealth: An Uncertainty-Aware Multi-Agent System for Autonomous Health Data Modeling [paper]
  80. [CAIN 2026] Engineering AI Agents for Clinical Workflows: A Case Study in Architecture, MLOps, and Governance [paper]
  81. [arxiv 2026.2] ExperienceWeaver: Optimizing Small-sample Experience Learning for LLM-based Clinical Text Improvement [paper]
  82. [arxiv 2026.1] EvoClinician: A Self-Evolving Agent for Multi-Turn Medical Diagnosis via Test-Time Evolutionary Learning [paper] [Github]
  83. [arxiv 2026.1] Scaling Medical Reasoning Verification via Tool-Integrated Reinforcement Learning [paper]
  84. [arxiv 2026.1] DEEPMED: Building a Medical DeepResearch Agent via Multi-hop Med-Search Data [paper]
  85. [arxiv 2026.1] AgentsEval: Clinically Faithful Evaluation of Medical Imaging Reports via Multi-Agent Reasoning [paper]
  86. [arxiv 2026.1] AgentEHR: Advancing Autonomous Clinical Decision-Making via Retrospective Summarization [paper]
  87. [arxiv 2026.1] MedConsultBench: A Full-Cycle, Fine-Grained, Process-Aware Benchmark for Medical Consultation Agents [paper]
  88. [EACL 2026] Knowing When to Abstain: Medical LLMs Under Clinical Uncertainty [paper]
  89. [arxiv 2026.1] Route, Retrieve, Reflect, Repair: Self-Improving Agentic Framework for Visual Detection and Linguistic Reasoning in Medical Imaging [paper] [Github]
  90. [arxiv 2026.1] MEDVISTAGYM: A Scalable Training Environment for Thinking with Medical Images via Tool-Integrated Reinforcement Learning [paper]
  91. [arxiv 2026.1] MedEinst: Benchmarking the Einstellung Effect in Medical LLMs through Counterfactual Differential Diagnosis [paper]
  92. [arxiv 2026.1] DemMA: Dementia Multi-Turn Dialogue Agent with Expert-Guided Reasoning and Action Simulation [paper]
  93. [arxiv 2026.1] IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs for Universal Biomedical Object Referring and Segmentation [paper]
  94. [arxiv 2026.1] Bayesian Orchestration of Multi-LLM Agents for Cost-Aware Sequential Decision-Making [paper]
  95. [arxiv 2026.1] An Explainable Agentic AI Framework for Uncertainty-Aware and Abstention-Enabled Acute Ischemic Stroke Imaging Decisions [paper]
  96. [AAAI 2026] ShortageSim: Simulating Drug Shortages under Information Asymmetry [paper]
  97. [ICLR 2026] MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale [paper] [Github]
  98. [AAAI 2026] LungNoduleAgent: A Collaborative Multi-Agent System for Precision Diagnosis of Lung Nodules [paper] [Github]
  99. [Nature Communications 2026] Wearable Intelligent Throat Enables Natural Speech in Stroke Patients with Dysarthria [paper]
  100. [npj Artificial Intelligence 2026] AI agent in healthcare: applications, evaluations, and future directions [paper]
  101. [npj Digital Medicine 2026] Benchmarking large language model-based agent systems for clinical decision tasks [paper]
  102. [npj Digital Medicine 2026] Reimagining psychiatric care with agentic AI: promise, challenges, and a roadmap forward [paper]
  103. [Nature Biotechnology 2026] Agentic AI and the rise of in silico team science in biomedical research [paper]

Year 2025

  1. [arxiv 2025.12] Hybrid-Code: A Privacy-Preserving, Redundant Multi-Agent Framework for Reliable Local Clinical Coding [paper]
  2. [arxiv 2025.12] ClinDEF: A Dynamic Evaluation Framework for Large Language Models in Clinical Reasoning [paper]
  3. [arxiv 2025.12] HARMON-E: Hierarchical Agentic Reasoning for Multimodal Oncology Notes to Extract Structured Data [paper]
  4. [arxiv 2025.12] Bidirectional human-AI collaboration in brain tumour assessments improves both expert human and AI agent performance [paper]
  5. [arxiv 2025.12] On-device Large Multi-modal Agent for Human Activity Recognition [paper]
  6. [arxiv 2025.12] Scalably Enhancing the Clinical Validity of a Task Benchmark with Physician Oversight [paper]
  7. [arxiv 2025.12] Agent-Based Output Drift Detection for Breast Cancer Response Prediction in a Multisite Clinical Decision Support System [paper]
  8. [arxiv 2025.12] An Agentic AI Framework for Training General Practitioner Student Skills [paper]
  9. [arxiv 2025.12] ReX-MLE: The Autonomous Agent Benchmark for Medical Imaging Challenges [paper]
  10. [arxiv 2025.12] AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning [paper]
  11. [arxiv 2025.12] A Multi-Agent Large Language Model Framework for Automated Qualitative Analysis [paper]
  12. [arxiv 2025.12] Mapis: A Knowledge-Graph Grounded Multi-Agent Framework for Evidence-Based PCOS Diagnosis [paper]
  13. [arxiv 2025.12] INFORM-CT: INtegrating LLMs and VLMs FOR Incidental Findings Management in Abdominal CT [paper]
  14. [arxiv 2025.12] Multi-Agent Medical Decision Consensus Matrix System: An Intelligent Collaborative Framework for Oncology MDT Consultations [paper]
  15. [arxiv 2025.12] Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis [paper]
  16. [arxiv 2025.12] MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data [paper]
  17. [arxiv 2025.12] Socratic Students: Teaching Language Models to Learn by Asking Questions [paper]
  18. [arxiv 2025.12] MedAI: Evaluating TxAgent's Therapeutic Agentic Reasoning in the NeurIPS CURE-Bench Competition [paper] [Benchmark & Competition]
  19. [arxiv 2025.12] CP-Env: Evaluating Large Language Models on Clinical Pathways in a Controllable Hospital Environment [paper] [Github]
  20. [arxiv 2025.12] AutoMedic: An Automated Evaluation Framework for Clinical Conversational Agents with Medical Dataset Grounding [paper]
  21. [arxiv 2025.12] Exploring Community-Powered Conversational Agent for Health Knowledge Acquisition: A Case Study in Colorectal Cancer [paper]
  22. [arxiv 2025.12] Multi-Agent Intelligence for Multidisciplinary Decision-Making in Gastrointestinal Oncology [paper]
  23. [arxiv 2025.12] DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning [paper]
  24. [arxiv 2025.12] ClinNoteAgents: An LLM Multi-Agent System for Predicting and Interpreting Heart Failure 30-Day Readmission from Clinical Notes [paper]
  25. [arxiv 2025.12] MedTutor-R1: Socratic Personalized Medical Teaching with Multi-Agent Simulation [paper] [Github]
  26. [arxiv 2025.12] MCP-AI: Protocol-Driven Intelligence Framework for Autonomous Reasoning in Healthcare [paper]
  27. [arxiv 2025.12] Multi-Aspect Knowledge-Enhanced Medical Vision-Language Pretraining with Multi-Agent Data Generation [paper]
  28. [arxiv 2025.12] Thucy: An LLM-based Multi-Agent System for Claim Verification across Relational Databases [paper]
  29. [arxiv 2025.12] Many-to-One Adversarial Consensus: Exposing Multi-Agent Collusion Risks in AI-Based Healthcare [paper]
  30. [arxiv 2025.12] FinAgent: An Agentic AI Framework Integrating Personal Finance and Nutrition Planning [paper]
  31. [arxiv 2025.12] Radiologist Copilot: Agentic AI Assistant for Holistic Radiology Reporting with Quality Control [paper]
  32. [arxiv 2025.12] UCAgents: Unidirectional Convergence for Visual Evidence Anchored Multi-Agent Medical Decision-Making [paper]
  33. [arxiv 2025.12] First, do NOHARM: towards clinically safe large language models [paper]
  34. [arxiv 2025.12] Causal Reinforcement Learning based Agent-Patient Interaction with Clinical Domain Knowledge [paper]
  35. [arxiv 2025.11] MedEyes: Learning Dynamic Visual Focus for Medical Progressive Diagnosis [paper] [GitHub]
  36. [arxiv 2025.11] MedSAM3: Delving into Segment Anything with Medical Concepts [paper] [Github]
  37. [arxiv 2025.11] SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction [paper]
  38. [arxiv 2025.11] KOM: A Multi-Agent Artificial Intelligence System for Precision Management of Knee Osteoarthritis (KOA) [paper]
  39. [arxiv 2025.11] KRAL: Knowledge and Reasoning Augmented Learning for LLM-assisted Clinical Antimicrobial Therapy [paper]
  40. [arxiv 2025.11] Medical Malice: A Dataset for Context-Aware Safety in Healthcare LLMs [paper]
  41. [arxiv 2025.11] MedBench v4: A Robust and Scalable Benchmark for Evaluating Chinese Medical Language Models, Multimodal Models, and Intelligent Agents [paper]
  42. [arxiv 2025.11] Fair-GNE: Generalized Nash Equilibrium-Seeking Fairness in Multiagent Healthcare Automation [paper]
  43. [arxiv 2025.11] MedDCR: Learning to Design Agentic Workflows for Medical Coding [paper]
  44. [arxiv 2025.11] Grounded by Experience: Generative Healthcare Prediction Augmented with Hierarchical Agentic Retrieval [paper]
  45. [arxiv 2025.11] OEMA: Ontology-Enhanced Multi-Agent Collaboration Framework for Zero-Shot Clinical Named Entity Recognition [paper]
  46. [arxiv 2025.11] MedBuild AI: An Agent-Based Hybrid Intelligence Framework for Reshaping Agency in Healthcare Infrastructure Planning through Generative Design for Medical Architecture [paper]
  47. [arxiv 2025.11] From Passive to Proactive: A Multi-Agent System with Dynamic Task Orchestration for Intelligent Medical Pre-Consultation [paper]
  48. [arxiv 2025.11] Fine-Tuning DialoGPT on Common Diseases in Rural Nepal for Medical Conversations [paper]
  49. [arxiv 2025.10] Traj-CoA: Patient Trajectory Modeling via Chain-of-Agents for Lung Cancer Risk Prediction [paper]
  50. [arxiv 2025.10] FT-ARM: Fine-Tuned Agentic Reflection Multimodal Language Model for Pressure Ulcer Severity Classification with Reasoning [paper]
  51. [arxiv 2025.10] SNOMED CT-powered Knowledge Graphs for Structured Clinical Data and Diagnostic Reasoning [paper]
  52. [arxiv 2025.10] Speculative Model Risk in Healthcare AI: Using Storytelling to Surface Unintended Harms [paper]
  53. [arxiv 2025.10] MedCoAct: Confidence-Aware Multi-Agent Collaboration for Complete Clinical Decision [paper]
  54. [arxiv 2025.10] Haibu Mathematical-Medical Intelligent Agent:Enhancing Large Language Model Reliability in Medical Tasks via Verifiable Reasoning Chains [paper]
  55. [arxiv 2025.10] Reinforcement Learning for Clinical Reasoning: Aligning LLMs with ACR Imaging Appropriateness Criteria [paper]
  56. [EMNLP 2025 Industry] CLARITY: Clinical Assistant for Routing, Inference, and Triage [paper]
  57. [arxiv 2025.10] Secure Multi-Modal Data Fusion in Federated Digital Health Systems via MCP [paper]
  58. [arxiv 2025.9] AgenticAD: A Specialized Multiagent System Framework for Holistic Alzheimer Disease Management [paper]
  59. [arxiv 2025.9] Agentic-AI Healthcare: Multilingual, Privacy-First Framework with MCP Agents [paper]
  60. [arxiv 2025.9] Online Decision Making with Generative Action Sets [paper]
  61. [arxiv 2025.9] PAME-AI: Patient Messaging Creation and Optimization using Agentic AI [paper]
  62. [arxiv 2025.9] A co-evolving agentic AI system for medical imaging analysis [paper]
  63. [arxiv 2025.9] FHIR-AgentBench: Benchmarking LLM Agents for Realistic Interoperable EHR Question Answering [paper] [Github]
  64. [arxiv 2025.9] MedFact: Benchmarking the Fact-Checking Capabilities of Large Language Models on Chinese Medical Texts [paper] [Github]
  65. [arxiv 2025.9] Agentic Temporal Graph of Reasoning with Multimodal Language Models: A Potential AI Aid to Healthcare [paper]
  66. [arxiv 2025.9] Using AI to Optimize Patient Transfer and Resource Utilization During Mass-Casualty Incidents: A Simulation Platform [paper]
  67. [arxiv 2025.9] Demo: Healthcare Agent Orchestrator (HAO) for Patient Summarization in Molecular Tumor Boards [paper] [Github]
  68. [arxiv 2025.9] Chatbot To Help Patients Understand Their Health [paper]
  69. [arxiv 2025.9] Code Like Humans: A Multi-Agent Solution for Medical Coding [paper]
  70. [arxiv 2025.8] The Anatomy of a Personal Health Agent [paper]
  71. [arxiv 2025.8] MedResearcher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework [paper] [Github]
  72. [arxiv 2025.8] ChatThero: An LLM-Supported Chatbot for Behavior Change and Therapeutic Support in Addiction Recovery [paper]
  73. [arxiv 2025.8] Automated Clinical Problem Detection from SOAP Notes using a Collaborative Multi-Agent LLM Architecture [paper]
  74. [arxiv 2025.8] Trustworthy Agents for Electronic Health Records through Confidence Estimation [paper]
  75. [arxiv 2025.8] AT-CXR: Uncertainty-Aware Agentic Triage for Chest X-rays [paper]
  76. [arxiv 2025.8] End-to-End Agentic RAG System Training for Traceable Diagnostic Reasoning [paper]
  77. [arxiv 2025.8] Organ-Agents: Virtual Human Physiology Simulator via LLMs [paper]
  78. [arxiv 2025.8] A Multi-Agent Approach to Neurological Clinical Reasoning [paper]
  79. [arxiv 2025.8] PASS: Probabilistic Agentic Supernet Sampling for Interpretable and Adaptive Chest X-Ray Reasoning [paper]
  80. [arxiv 2025.8] HealthFlow: A Self-Evolving AI Agent with Meta Planning for Autonomous Healthcare Research [paper] [code]
  81. [arxiv 2025.8] ConfAgents: A Conformal-Guided Multi-Agent Framework for Cost-Efficient Medical Diagnosis [paper]
  82. [arxiv 2025.8] Colacare: Enhancing electronic health record modeling through large language model-driven multi-agent collaboration [paper][project page]
  83. [arxiv 2025.8] FEAT: A Multi-Agent Forensic AI System with Domain-Adapted Large Language Model for Automated Cause-of-Death Analysis [paper]
  84. [arxiv 2025.8] Are Large Language Models Dynamic Treatment Planners? An In Silico Study from a Prior Knowledge Injection Angle [paper]
  85. [arxiv 2025.8] Tree-of-Reasoning: Towards Complex Medical Diagnosis via Multi-Agent Reasoning with Evidence Tree [paper]
  86. [arxiv 2025.8] A Multi-Agent System for Complex Reasoning in Radiology Visual Question Answering [paper]
  87. [arxiv 2025.8] Patho-AgenticRAG: Towards Multimodal Agentic Retrieval-Augmented Generation for Pathology VLMs via Reinforcement Learning [paper] [code]
  88. [arxiv 2025.8] Agent-Based Feature Generation from Clinical Notes for Outcome Prediction [paper]
  89. [arxiv 2025.8] GMAT: Grounded Multi-Agent Clinical Description Generation for Text Encoder in Vision-Language MIL for Whole Slide Image Classification [paper]
  90. [arxiv 2025.8] A Multi-Agent Approach to Neurological Clinical Reasoning [paper]
  91. [biorxiv 2025.8] BioScientistAgent: Designing LLM-Biomedical Agents with KG-Augmented RL Reasoning Modules for Drug Repurposing and Mechanistic of Action Elucidation [paper]
  92. [arxiv 2025.7] Agentic AI framework for end-to-end medical data inference [paper]
  93. [arxiv 2025.7] Resilient Multi-Agent Negotiation for Medical Supply Chains: Integrating LLMs and Blockchain for Transparent Coordination [paper]
  94. [arxiv 2025.7] Intelligent Virtual Sonographer (IVS): Enhancing Physician-Robot-Patient Communication [paper]
  95. [arxiv 2025.7] A Comprehensive Survey of Electronic Health Record Modeling: From Deep Learning Approaches to Large Language Models [paper] [project page]
  96. [arxiv 2025.7] Infherno: End-to-end agent-based FHIR resource synthesis from free-form clinical notes [paper]
  97. [arxiv 2025.7] Multi-agent retrieval-augmented framework for evidence-based counterspeech against health misinformation [paper]
  98. [arxiv 2025.7] AI-VaxGuide: An Agentic RAG-Based LLM for Vaccination Decisions [paper]
  99. [arxiv 2025.7] Multi-Agent Reasoning for Cardiovascular Imaging Phenotype Analysis [paper]
  100. [arxiv 2025.7] DynamiCare: A Dynamic Multi-Agent Framework for Interactive and Open-Ended Medical Decision-Making [paper]
  101. [arxiv 2025.7] KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs [paper]
  102. [arxiv 2025.7] STELLA: Self-Evolving LLM Agent for Biomedical Research [paper] [Github]
  103. [arxiv 2025.6] MedOrch: Medical Diagnosis with Tool-Augmented Reasoning Agents for Flexible Extensibility [paper]
  104. [arxiv 2025.6] MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal Medical Reasoning [paper]
  105. [arxiv 2025.6] From EHRs to Patient Pathways: Scalable Modeling of Longitudinal Health Trajectories with LLMs [paper]
  106. [arxiv 2025.6] Evidence-based diagnostic reasoning with multi-agent copilot for human pathology [paper]
  107. [arxiv 2025.6] An agentic system for rare disease diagnosis with traceable reasoning [paper] [demo]
  108. [arxiv 2025.6] Standard Applicability Judgment and Cross-jurisdictional Reasoning: A RAG-based Framework for Medical Device Compliance [paper]
  109. [arxiv 2025.6] From RAG to Agentic: Validating Islamic-Medicine Responses with LLM Agents [paper]
  110. [arxiv 2025.6] PRISM2: Unlocking Multi-Modal General Pathology AI with Clinical Dialogue [paper]
  111. [arxiv 2025.6] Tiered Agentic Oversight: A Hierarchical Multi-Agent System for Healthcare Safety [paper]
  112. [arxiv 2025.6] The Optimization Paradox in Clinical AI Multi-Agent Systems [paper]
  113. [EMNLP 2025] AUTOCT: Automating Interpretable Clinical Trial Prediction with LLM Agents [paper]
  114. [arxiv 2025.6] AI Agents for Conversational Patient Triage: Preliminary Simulation-Based Evaluation with Real-World EHR Data [paper]
  115. [arxiv 2025.6] VChatter: Exploring Generative Conversational Agents for Simulating Exposure Therapy to Reduce Social Anxiety [paper]
  116. [ACL 2025 Findings] AnnaAgent: Dynamic Evolution Agent System with Multi-Session Memory for Realistic Seeker Simulation [paper] [code]
  117. [ACL 2025] ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical Agents [paper] [Github] [Project]
  118. [arxiv 2025.6] RadFabric: Agentic AI System with Reasoning Capability for Radiology [Paper] [Project] |
  119. [arxiv 2025.5] CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model Agents [paper] [code]
  120. [arxiv 2025.5] BehaviorSFT: Behavioral Token Conditioning for Clinical Agents Across the Proactivity Spectrum [paper]
  121. [arxiv 2025.5] Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making [paper]
  122. [NeurIPS 2025] CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis Mimicking Pathologists' Diagnostic Logic [paper]
  123. [arxiv 2025.5] Are Vision Language Models Ready for Clinical Diagnosis? A 3D Medical Benchmark for Tumor-centric Visual Question Answering [paper] [code]
  124. [arxiv 2025.5] Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine [paper]
  125. [NeurIPS 2025] Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions [paper]
  126. [arxiv 2025.5] CT-Agent: A Multimodal-LLM Agent for 3D CT Radiology Question Answering [paper]
  127. [arxiv 2025.5] A Risk Taxonomy for Evaluating AI-Powered Psychotherapy Agents [paper]
  128. [NeurIPS 2025] MedAgentBoard: Benchmarking Multi-Agent Collaboration with Conventional Methods for Diverse Medical Tasks [paper] [project page]
  129. [arxiv 2025.5] A Multimodal Multi-Agent Framework for Radiology Report Generation [paper]
  130. [EMNLP 2025] DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue [paper] [code]
  131. [biorxiv 2025.5] Biomni: A general-purpose biomedical ai agent [paper]
  132. [arxiv 2025.4] Llm agent swarm for hypothesis-driven drug discovery [paper]
  133. [arxiv 2025.4] Towards a HIPAA Compliant Agentic AI System in Healthcare [paper]
  134. [arxiv 2025.4] Customizing emotional support: How do individuals construct and interact with LLM-powered chatbots [paper]
  135. [arxiv 2025.4] Privacy-Preserving Operating Room Workflow Analysis using Digital Twins [paper]
  136. [arxiv 2025.4] An LLM-Driven Multi-Agent Debate System for Mendelian Diseases [paper]
  137. [arxiv 2025.4] Txgemma: Efficient and agentic llms for therapeutics [paper]
  138. [medrxiv 2025.4] TrialGenie: Empowering Clinical Trial Design with Agentic Intelligence and Real World Data [paper]
  139. [MICCAI 2025] Operating room workflow analysis via reasoning segmentation over digital twins [paper]
  140. [arxiv 2025.3] TAMA: A Human--AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews [paper]
  141. [arxiv 2025.3] Autonomous Radiotherapy Treatment Planning Using DOLA: A Privacy-Preserving, LLM-Based Optimization Agent [paper]
  142. [arxiv 2025.3] The Application of MATEC (Multi-AI Agent Team Care) Framework in Sepsis Care [paper]
  143. [EMNLP 2025] MDTeamGPT: A Self-Evolving LLM-Based Multi-Agent Framework for Multi-Disciplinary Team Medical Consultation [paper] [GitHub]
  144. [arxiv 2025.3] RAG-KG-IL: A Multi-Agent Hybrid Framework for Reducing Hallucinations and Enhancing LLM Reasoning through RAG and Incremental Knowledge Graph Learning Integration [paper]
  145. [arxiv 2025.3] MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways [paper]
  146. [ICASSP 2025] A Self-Evolving Framework for Multi-Agent Medical Consultation Based on Large Language Models [paper]
  147. [arxiv 2025.3] TxAgent: An AI agent for therapeutic reasoning across a universe of tools [paper]
  148. [arxiv 2025.3] MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning [paper] [project page]
  149. [arxiv 2025.3] Towards conversational ai for disease management [paper]
  150. [arxiv 2025.3] GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report Evaluation [paper]
  151. [EMNLP 2025 Findings] MIND: Towards Immersive Psychological Healing with Multi-Agent Inner Dialogue [paper]
  152. [arxiv 2025.2] Enhancing hepatopathy clinical trial efficiency: a secure, large language model-powered pre-screening pipeline [paper]
  153. [arxiv 2025.2] RAG-Enhanced Collaborative LLM Agents for Drug Discovery [paper]
  154. [EMNLP 2025 Findings] Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge [paper]
  155. [arxiv 2025.2] An LLM-Powered Agent for Physiological Data Analysis: A Case Study on PPG-based Heart Rate Estimation [paper]
  156. [arxiv 2025.2] Regulatory science innovation for generative AI and large language models in health and medicine: a global call for action [paper]
  157. [ACL 2025] Cami: A counselor agent supporting motivational interviewing through state inference and topic exploration [paper]
  158. [ICML 2025] MedRAX: Medical Reasoning Agent for Chest X-ray [paper] [code]
  159. [ICCV 2025] PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology [Paper] [project page] [Github]
  160. [arxiv 2025.2] M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging [paper]
  161. [NEJM AI 2025] MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents [paper] [project page]
  162. [arxiv 2025.1] AI Chatbots as Professional Service Agents: Developing a Professional Identity [paper]
  163. [arxiv 2025.1] Exploring the inquiry-diagnosis relationship with advanced patient simulators [paper] [project page]
  164. [ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding [paper] [project page]
  165. [arxiv 2025.1] AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling [paper]
  166. [medrxiv 2025.1] Advancing the prediction and understanding of placebo responses in chronic back pain using large language models [paper]
  167. [Nature] Towards conversational diagnostic artificial intelligence [paper]
  168. [Nature Communications 2025] AgentMD: Empowering Language Agents for Risk Prediction with Large-Scale Clinical Tool Learning [paper]
  169. [Intelligent Medicine] Evaluating large language models and agents in healthcare: key challenges in clinical applications [paper]
  170. [npj Digital Medicine] Evaluating large language models as agents in the clinic [paper]
  171. [Nature Medicine 2025] An evaluation framework for clinical use of large language models in patient interaction tasks [paper]
  172. [Nature Communications 2025] An automated framework for assessing how well LLMs cite relevant medical references [paper]
  173. [Nature BME 2025] CRISPR-GPT for agentic automation of gene-editing experiments [paper]
  174. [Nature Methods 2025] GeneAgent: self-verification language agent for gene-set analysis using domain databases [paper]
  175. [npj Digital Medicine] CARE-AD: A Multi-Agent Large Language Model Framework for Alzheimer's Disease Prediction Using Longitudinal Clinical Notes [paper]
  176. [npj Digital Medicine] Vision-language model for report generation and outcome prediction in CT pulmonary angiogram [paper]
  177. [npj Artificial Intelligence] HealthcareAgent: Eliciting the Power of Large Language Models for Medical Consultation [paper]
  178. [Scientific Reports 2025] Democratizing cost-effective, agentic artificial intelligence to multilingual medical summarization through knowledge distillation [paper]
  179. [Scientific Reports 2025] A multi-agent system based on HNC for domain-specific machine translation [paper]
  180. [biorxiv 2025.6] HEAL-KGGen: A Hierarchical Multi-Agent LLM Framework with Knowledge Graph Enhancement for Genetic Biomarker-Based Medical Diagnosis [paper]
  181. [JAMIA 2025] Improving Large Language Model Applications in Biomedicine with Retrieval-Augmented Generation: A Systematic Review, Meta-Analysis, and Clinical Development Guidelines [paper]
  182. [JAMIA Open 2025] Conversational health agents: a personalized large language model-powered agent framework [paper]
  183. [JMIR] The Effectiveness of a Custom AI Chatbot for Type 2 Diabetes Mellitus Health Literacy: Development and Evaluation Study [paper]
  184. [JMIR Aging 2025] The PDC30 Chatbot—Development of a Psychoeducational Resource on Dementia Caregiving Among Family Caregivers: Mixed Methods Acceptability Study [paper]
  185. [JoVE] Evidence-based knowledge synthesis and hypothesis validation: Navigating biomedical knowledge bases via explainable ai and agentic systems [paper]
  186. [arxiv 2024.8] Drugagent: Multi-agent large language model-based reasoning for drug-target interaction prediction [paper]
  187. [Bioinformatics 2025] ESCARGOT: an AI agent leveraging large language models, dynamic graph of thoughts, and biomedical knowledge graphs for enhanced reasoning [paper]
  188. [Healthcare (Basel) 2025] MedScrubCrew: A Medical Multi-Agent Framework for Automating Appointment Scheduling Based on Patient-Provider Profile Resource Matching [paper]
  189. [Clinical Neurophysiology 2025] Agent-guided AI-powered interpretation and reporting of nerve conduction studies and EMG (INSPIRE) [paper]
  190. [Expert Systems with Applications 2025] A two-stage proactive dialogue generator for efficient clinical information collection using large language model [paper]
  191. [Physics in Medicine & Biology 2025] A feasibility study of automating radiotherapy planning with large language model agents [paper]
  192. [JCO 2025] A large language model (LLM)-based multi-agent framework for risk stratification and treatment recommendations in localized prostate cancer (locPCa). [paper]
  193. [ICDH] Voice-based AI Agents: Filling the Economic Gaps in Digital Health Delivery [paper]
  194. [IEEE EMBC 2025] Knowledge-infused LLM-powered conversational health agent: A case study for diabetes patients [paper]
  195. [ICLR 2025] MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models [paper]
  196. [ACL 2025] Medical Graph RAG: Evidence-based Medical Large Language Model via Graph Retrieval-Augmented Generation [paper]
  197. [ACL Findings 2025] MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration [paper]
  198. [ACL Findings 2025] ASTRID--An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems [paper]
  199. [NAACL 2025] A Layered Debating Multi-Agent System for Similar Disease Diagnosis [paper]
  200. [NAACL 2025] Menti: Bridging medical calculator and llm agent with nested tool calling [paper]
  201. [COLING 2025] Unveiling performance challenges of large language models in low-resource healthcare: A demographic fairness perspective [paper]
  202. [ICMI 2025] An LLM-powered Socially Interactive Agent with Adaptive Facial Expressions for Conversing about Health [paper]
  203. [MICCAI 2025 (Oral)] WSI-Agents: A Collaborative Multi-Agent System for Multi-Modal Whole Slide Image Analysis [Paper] [GitHub]
  204. [MICCAI 2025] Multi-Agent Reasoning for Cardiovascular Imaging Phenotype Analysis [Paper] [GitHub]
  205. [MICCAI 2025] DentEval: Fine-tuning-Free Expert-Aligned Assessment in Dental Education via LLM Agents [Paper] [GitHub]
  206. [MICCAI 2025] CSAP-Assist: Instrument-Agent Dialogue Empowered Vision-Language Models for Collaborative Surgical Action Planning [Paper] [GitHub]
  207. [MICCAI 2025] MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions [Paper] [Github]
  208. [MICCAI 2025 workshop] AURA: A Multi-Modal Medical Agent for Understanding, Reasoning & Annotation [paper] [github]
  209. [ICT4AWE 2025] MentalRAG: Developing an Agentic Framework for Therapeutic Support Systems [paper]
  210. [MLHC 2025] Evaluation of Multi-Agent LLMs in Multidisciplinary Team Decision-Making for Challenging Cancer Cases [paper]
  211. [Journal of imaging informatics in medicine] AgentMRI: A Vison Language Model-Powered AI System for Self-regulating MRI Reconstruction with Multiple Degradations [paper]
  212. [COLM 2025] Can A Society of Generative Agents Simulate Human Behavior and Inform Public Health Policy? A Case Study on Vaccine Hesitancy [paper]
  213. [ACL 2025 Findings] PIORS: Personalized Intelligent Outpatient Reception based on Large Language Model with Multi-Agents Medical Scenario Simulation [paper] [project page]
  214. [Communications Medicine 2025] Simulated patient systems are intelligent when powered by large language model-based AI agents [paper]
  215. [AAMAS 2025] On the limits of agency in agent-based models [paper]
  216. [ACL 2025 Findings] Cod, towards an interpretable medical agent using chain of diagnosis [paper] [Github]
  217. [Advanced Intelligent Systems 2025] Inquire, Interact, and Integrate: A Proactive Agent Collaborative Framework for Zero-Shot Multimodal Medical Reasoning [paper]
  218. [NeurIPS 2025] Clinicallab: Aligning agents for multi-departmental clinical diagnostics in the real world [paper]
  219. [Cell Reports Medicine 2025] Development and Testing of a Novel Large Language Model-Based Clinical Decision Support Systems for Medication Safety in 12 Clinical Specialties [paper]
  220. [PMLR 2025] KG4Diagnosis: A Hierarchical Multi-Agent LLM Framework with Knowledge Graph Enhancement for Medical Diagnosis [paper]
  221. [Nature Machine Intelligence 2025] LLM-based agentic systems in medicine and healthcare [paper]
  222. [ACL 2025 Findings] A Survey of LLM-based Agents in Medicine: How far are we from Baymax? [paper] [Github]
  223. [TechRxiv 2025] The Landscape of Medical Agents: A Survey [paper] [Github]
  224. [TechRxiv 2025] Agentic large-language-model systems in medicine: A systematic review and taxonomy [paper]
  225. [Medicine Advances 2025] Agentic large language models for healthcare: current progress and future opportunities [paper]
  226. [TechRxiv 2025] A Survey of LLM-based Multi-agent Systems in Medicine [paper]
  227. [Cell Reports Medicine 2025] A foundational architecture for AI agents in healthcare [paper]
  228. [Nature Biomedical Engineering 2025] Coordinated AI agents for advancing healthcare [paper]
  229. [Cell Reports Medicine 2025] Next-generation agentic AI for transforming healthcare [paper]
  230. [Information (MDPI) 2025] Large Language Model Agents for Biomedicine: A Comprehensive Review of Methods, Evaluations, Challenges, and Future Directions [paper]
  231. [PLOS ONE 2025] Artificial intelligence agents in healthcare research: A scoping review [paper]
  232. [npj Digital Medicine 2025] Enhancing diagnostic capability with multi-agents conversational large language models [paper] [Github]
  233. [International Journal of Medical Informatics 2025] Applications of artificial intelligence-based conversational agents in healthcare: A systematic umbrella review [paper]
  234. [HAL 2025] Scoping Review of Agentic AI Systems in Healthcare [paper]
  235. [Preprints.org 2025] AI Agents in Modern Healthcare: From Foundation to Pioneer — A Comprehensive Review and Implementation Roadmap for Impact and Integration in Clinical Settings [paper]
  236. [Asian Journal of Medical Principles and Clinical Practice 2025] Multi-Agent AI Systems in Healthcare: A Systematic Review Enhancing Clinical Decision-Making [paper]
  237. [medRxiv 2025] AI agents in clinical medicine: a systematic review [paper]
  238. [Radiology: Artificial Intelligence (RSNA) 2025] Agentic AI in Radiology: Evolution from Large Language Models to Future Clinical Integration [paper]
  239. [Indian Journal of Radiology and Imaging 2025] From chatbots to agentic workflows: ensuring responsible deployment of large language models in radiology [paper]
  240. [Bioengineering (MDPI) 2025] Agentic AI and Large Language Models in Radiology: Opportunities and Hallucination Challenges [paper]
  241. [arxiv 2025.10] Agentic systems in radiology: Design, Applications, Evaluation, and Challenges [paper]
  242. [British Journal of Radiology 2025] Agentic AI in radiology: emerging potential and unresolved challenges [paper]
  243. [Radiography 2025] Agentic systems in radiology: Principles, opportunities, privacy risks, regulation, and sustainability concerns [paper]
  244. [Tomography (MDPI) 2025] The Role of Agentic AI in Musculoskeletal Radiology: A Scoping Review [paper]
  245. [Nurse Education Today 2025] Large language model-driven agents in nursing practice: A scoping review [paper]
  246. [Communications Medicine 2025] Simulated patient systems powered by large language model-based AI agents offer potential for transforming medical education [paper]
  247. [Biocomputing 2025] Using large language models for efficient cancer registry coding in the real hospital setting: A feasibility study [paper]

Year 2024

  1. [arxiv 2024.12] PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children [paper]
  2. [IEEE Big Data] SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot [paper] [code]
  3. [Bioinformatics] AI-HOPE: an AI-driven conversational agent for enhanced clinical and genomic data integration in precision medicine research [paper]
  4. [arxiv 2024.10] IMAS: A Comprehensive Agentic Approach to Rural Healthcare Delivery [paper] [project page]
  5. [arxiv 2024.10] KGARevion: An AI Agent for Knowledge-Intensive Biomedical QA [paper] [Github] [Project]
  6. [arxiv 2024.10] Zodiac: A Cardiologist-Level LLM Framework for Multi-Agent Diagnostics [paper]
  7. [arxiv 2024.9] Chatting Up Attachment: Using LLMs to Predict Adult Bonds [paper]
  8. [MLHC 2024] MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance [paper] [project page]
  9. [arxiv 2024.8] Agentic llm workflows for generating patient-friendly medical reports [paper] [project page]
  10. [ACM UIST 2024] Compeer: A generative conversational agent for proactive peer support [paper]
  11. [arxiv 2024.7] Cactus: Towards psychological counseling conversations using cognitive behavioral theory [paper]
  12. [TMI] Integration of Multi-Source Medical Data for Medical Diagnosis Question Answering [paper]
  13. [ICLR 2025 Oral] Pathgen-1.6m: 1.6 million pathology image-text pairs generation through multi-agent collaboration [paper] [project page]
  14. [arxiv 2024.7] MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control [paper]
  15. [arxiv 2024.6] Exploring llm multi-agents for icd coding [paper]
  16. [arxiv 2024.12] Enhancing LLMs for Impression Generation in Radiology Reports through a Multi-Agent System [paper]
  17. [ICML 2024 AI for Science Workshop] TriageAgent: Towards Better Multi-Agents Collaborations for Large Language Model-Based Clinical Triage [paper]
  18. [KDD'24 Workshop] EHRFlow: A Large Language Model-Driven Iterative Multi-Agent Electronic Health Record Data Analysis Workflow [paper]
  19. [arxiv 2024.12] Agents on the Bench: Large Language Model Based Multi-Agent Framework for Trustworthy Digital Justice [paper]
  20. [MLHS 2025] Path-RAG: Knowledge-Guided Key Region Retrieval for Open-ended Pathology Visual Question Answering [paper]
  21. [NeurIPS 2024] MEDIQ: Question-Asking LLMs and a Benchmark for Medical Information-Seeking [paper] [project page]
  22. [arxiv 2024.6] CliBench: A Multifaceted and Multigranular Evaluation of Clinical Diagnosis with LLMs [paper]
  23. [arxiv 2024.5] AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments [paper]
  24. [AAAI 2025 workshop AI4Research] Drugagent: Automating ai-aided drug discovery programming through llm multi-agent collaboration [paper]
  25. [arxiv 2024.5] Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents [paper]
  26. [EMNLP 2024] Ehragent: Code empowers large language models for few-shot complex tabular reasoning on electronic health records [paper]
  27. [NeurIPS 2024 Oral] Mdagents: An adaptive collaboration of llms for medical decision-making [paper] [project page]
  28. [arxiv 2024.3] Llms-based few-shot disease predictions using ehr: A novel approach combining predictive agent reasoning and critical agent instruction [paper]
  29. [npj Digital Medicine] PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models [paper]
  30. [arxiv 2024.1] A general-purpose AI avatar in healthcare [paper]
  31. [The Lancet Digital Health] A future role for health applications of large language models depends on regulators enforcing safety standards [paper]
  32. [npj Digital Medicine] Autonomous medical evaluation for guideline adherence of large language models [paper]
  33. [Diagn Interv Radiol 2024] Large language models in radiology: fundamentals, applications, ethical considerations, risks, and future directions [paper]
  34. [PACIFIC SYMPOSIUM ON BIOCOMPUTING 2024] A conversational agent for early detection of neurotoxic effects of medications through automated intensive observation [paper]
  35. [JAMIA Open 2024] Conversational health agents: A personalized llm-powered agent framework [paper] [project page]
  36. [JMIR 2024] Mitigating cognitive biases in clinical decision-making through multi-agent conversations using large language models: simulation study [paper]
  37. [JMIR 2024] A language model--powered simulated patient with automated feedback for history taking: Prospective study [paper]
  38. [IEEE SoftCOM 2024] A multi-agent architecture for privacy-preserving natural language interaction with FHIR-based electronic health records [paper]
  39. [IEEE ISDFS 2024] Llm-based framework for administrative task automation in healthcare [paper]
  40. [IEEE Access 2024] Knowledge-Routed Automatic Diagnosis With Heterogeneous Patient-Oriented Graph [paper]
  41. [EMNLP Findings 2024] MMedAgent: Learning to Use Medical Tools with Multi-modal Agent [paper]
  42. [EMNLP 2024] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models [paper] [Github]
  43. [ACL Findings 2024] Benchmarking large language models on communicative medical coaching: a dataset and a novel system [paper]
  44. [ACL Findings 2024] Medagents: Large language models as collaborators for zero-shot medical reasoning [paper]
  45. [AAAI 2024] PathAsst: A Generative Foundation AI Assistant towards Artificial General Intelligence of Pathology [paper] [Github]
  46. [CHI 2024] Understanding the impact of long-term memory on self-disclosure with large language model-driven chatbots for public health intervention [paper]
  47. [CHI EA 2024] Conversational AI in health: Design considerations from a Wizard-of-Oz dermatology case study with users, clinicians and a medical LLM [paper]
  48. [ACM IMWUT 2024] Talk2Care: An LLM-based Voice Assistant for Communication between Healthcare Providers and Older Adults [paper]
  49. [ArabicNLP 2024] Synthetic arabic medical dialogues using advanced multi-agent llm techniques [paper]
  50. [ECCV Workshop 2024] Medco: Medical education copilots based on a multi-agent framework [paper]
  51. [Healthcare Information 2024] A Medical Consultation System for Geriatric Disease Based on Multi-agent Architecture and Knowledge Graph [paper]
  52. [Cell 2024] Empowering biomedical discovery with AI agents [paper] [Github]

Year 2023

  1. [NeurIPS workshop 2023] Are we going mad? benchmarking multi-agent debate between language models for medical q&a [paper]
  2. [arxiv 2023.1] Talk2Care: Facilitating asynchronous patient-provider communication with large-language-model [paper]
  3. [AMIA Annual Symposium Proceedings] Understanding the benefits and challenges of using large language model-based conversational agents for mental well-being support [paper]
  4. [Clinical NLP 2023] DERA: enhancing large language model completions with dialog-enabled resolving agents [paper] [dataset]
  5. [JMIR] The ChatGPT (generative artificial intelligence) revolution has made artificial intelligence approachable for medical professionals [paper]
  6. [JMIR] Automated monitoring of adherence to evidenced-based clinical guideline recommendations: design and implementation study [paper]
  7. [JMIR Med Educ 2023] Using ChatGPT for clinical practice and medical education: cross-sectional survey of medical students’ and physicians’ perceptions [paper]
  8. [CHI 2023] Assertiveness-based agent communication for a personalized medicine on medical imaging diagnosis [paper]

Papers by Category


1. Doctor-facing Agents

1.1 Multi-Modal Clinical Agents

(Agents designed to process and reason over multiple data types like images, text, and structured data)

Title Venue Date Paper Link Project Page
MedOpenClaw: Auditable Medical Imaging Agents Reasoning over Uncurated Full Studies arXiv 2026.03 Paper Star
GitHub
Project
Cerebra: A Multidisciplinary AI Board for Multimodal Dementia Characterization and Risk Assessment arXiv 2026.03 Paper Not Available
Shifting Adaptation from Weight Space to Memory Space: A Memory-Augmented Agent for Medical Image Segmentation arXiv 2026.03 Paper Not Available
Evolving Medical Imaging Agents via Experience-driven Self-skill Discovery arXiv 2026.03 Paper Not Available
Towards a Medical AI Scientist arXiv 2026.03 Paper Project
Meissa: Multi-modal Medical Agentic Intelligence arXiv 2026.03 Paper Star
GitHub
CARE: Towards Clinical Accountability in Multi-Modal Medical Reasoning ICLR 2026.03 Paper Project
3DMedAgent: Unified Perception-to-Understanding for 3D Medical Analysis arXiv 2026.02 Paper Not Available
CoMMa: Contribution-Aware Medical Multi-Agents From A Game-Theoretic Perspective arXiv 2026.02 Paper Not Available
MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs arXiv 2026.02 Paper Not Available
Picking the Right Specialist: Attentive Neural Process-based Selection of Task-Specialized Models arXiv 2026.02 Paper Not Available
Human-Guided Agentic AI for Multimodal Clinical Prediction ICHI 2026.02 Paper Not Available
MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic RL arXiv 2026.02 Paper Star
GitHub
IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs arXiv 2026.01 Paper Not Available
MedEyes: Learning Dynamic Visual Focus for Medical Progressive Diagnosis arXiv 2025.11 Paper Star
GitHub
MedSAM3: Delving into Segment Anything with Medical Concepts arXiv 2025.11 Paper Star
GitHub
AURA: A Multi-modal Medical Agent for Understanding, Reasoning & Annotation MICCAI workshop 2025.07 Paper Star
GitHub
MedAgent-Pro: Towards Evidence-based Multi-modal Medical Diagnosis via Reasoning Agentic Workflow arXiv 2025.03 Paper Star
GitHub
M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging arXiv 2025.02 Paper Star
GitHub
MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration ACL 2025 Paper Star
GitHub
MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions MICCAI 2025 Paper Star
GitHub
MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making NeurIPS (Oral) 2024 Paper Star
GitHub
MMedAgent: Learning to Use Medical Tools with Multi-modal Agent EMNLP Findings 2024 Paper Star
GitHub

1.2 Radiology Agents (CT, X-ray, MRI, etc.)

Title Venue Date Paper Link Project Page
EviAgent: Evidence-Driven Agent for Radiology Report Generation arXiv 2026.03 Paper Not Available
Agentic Automation of BT-RADS Scoring: End-to-End Multi-Agent System for Standardized Brain Tumor Follow-up Assessment arXiv 2026.03 Paper Not Available
DUCX: Decomposing Unfairness in Tool-Using Chest X-ray Agents arXiv 2026.03 Paper Not Available
Can Agents Distinguish Visually Hard-to-Separate Diseases in a Zero-Shot Setting? arXiv 2026.02 Paper Star
GitHub
Which Tool Response Should I Trust? Tool-Expertise-Aware CXR Agent with Multimodal Agentic Learning arXiv 2026.02 Paper Not Available
Perfusion Imaging and Single Material Reconstruction in Polychromatic Photon Counting CT arXiv 2026.02 Paper Star
GitHub
Route, Retrieve, Reflect, Repair: Self-Improving Agentic Framework for Visual Detection arXiv 2026.01 Paper Star
GitHub
Explainable Agentic AI Framework for Acute Ischemic Stroke Imaging Decisions arXiv 2026.01 Paper Not Available
LungNoduleAgent: A Collaborative Multi-Agent System for Precision Diagnosis of Lung Nodules AAAI 2026.1 Paper Star
GitHub
Bidirectional human-AI collaboration in brain tumour assessments improves both expert human and AI agent performance arXiv 2025.12 Paper Not Available
INFORM-CT: INtegrating LLMs and VLMs FOR Incidental Findings Management in Abdominal CT arXiv 2025.12 Paper Not Available
Radiologist Copilot: Agentic AI Assistant for Holistic Radiology Reporting with Quality Control arXiv 2025.12 Paper Not Available
A Multi-Agent System for Complex Reasoning in Radiology Visual Question Answering arXiv 2025.08 Paper Not Available
AT-CXR: Uncertainty-Aware Agentic Triage for Chest X-rays arXiv 2025.08 Paper Star
GitHub
PASS: Probabilistic Agentic Supernet Sampling for Interpretable and Adaptive Chest X-Ray Reasoning arXiv 2025.08 Paper Star
GitHub
RadFabric: Agentic AI System with Reasoning Capability for Radiology arXiv 2025.06 Paper Project
A Multimodal Multi-Agent Framework for Radiology Report Generation arXiv 2025.05 Paper Not Available
CT-Agent: A Multimodal-LLM Agent for 3D CT Radiology Question Answering arXiv 2025.05 Paper Not Available
MedRAX: Medical reasoning agent for chest x-ray ICML 2025.02 Paper Star
GitHub
Vision-language model for report generation and outcome prediction in CT pulmonary angiogram npj Digital Medicine 2025 Paper Star
GitHub
AgentMRI: A Vison Language Model-Powered AI System for Self-regulating MRI Reconstruction with Multiple Degradations Journal of imaging informatics in medicine 2025 Paper Not Available
Enhancing LLMs for Impression Generation in Radiology Reports through a Multi-Agent System arXiv 2024.12 Paper Not Available

1.3 Pathology Agents

Title Venue Date Paper Link Project Page
Computational Pathology in the Era of Emerging Foundation and Agentic AI -- International Expert Perspectives arXiv 2026.03 Paper Not Available
LAMMI-Pathology: A Tool-Centric Bottom-Up LVLM-Agent Framework for Molecularly Informed Medical Intelligence arXiv 2026.02 Paper Not Available
SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction arXiv 2025.11 Paper Not Available
GMAT: Grounded Multi-Agent Clinical Description Generation for Text Encoder in Vision-Language MIL arXiv 2025.08 Paper Not Available
Patho-AgenticRAG: Towards Multimodal Agentic Retrieval-Augmented Generation for Pathology VLMs arXiv 2025.08 Paper Star
GitHub
Evidence-based diagnostic reasoning with multi-agent copilot for human pathology arXiv 2025.06 Paper Not Available
CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis NeurIPS 2025.05 Paper Not Available
PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology ICCV 2025.02 Paper project Star
GitHub
WSI-Agents: A Collaborative Multi-Agent System for Multi-Modal Whole Slide Image Analysis MICCAI (Oral) 2025 Paper Star
GitHub
Path-RAG: Knowledge-Guided Key Region Retrieval for Open-ended Pathology Visual Question Answering MLHS 2025 Paper Star
GitHub
Pathgen-1.6m: 1.6 million pathology image-text pairs generation through multi-agent collaboration ICLR (Oral) 2024 Paper Star
GitHub
PathAsst: A Generative Foundation AI Assistant towards Artificial General Intelligence of Pathology AAAI 2024 Paper Star
GitHub

1.4 Cardiovascular Imaging

Title Venue Date Paper Link Project Page
Multi-Agent Reasoning for Cardiovascular Imaging Phenotype Analysis MICCAI 2025.07 Paper Star
GitHub

1.5 Sonography / Ultrasound

Title Venue Date Paper Link Project Page
Anatomical Prior-Driven Framework for Autonomous Robotic Cardiac Ultrasound Standard View Acquisition ICRA 2026.03 Paper Not Available
Intelligent Virtual Sonographer (IVS): Enhancing Physician-Robot-Patient Communication arXiv 2025.07 Paper Star
GitHub

1.6 Radiotherapy

Title Venue Date Paper Link Project Page
Autonomous Radiotherapy Treatment Planning Using DOLA: A Privacy-Preserving, LLM-Based Optimization Agent arXiv 2025.03 Paper Not Available
A feasibility study of automating radiotherapy planning with large language model agents Physics in Medicine & Biology 2025 Paper Not Available

1.7 Dermatology

Title Venue Date Paper Link Project Page
Conversational AI in health: Design considerations from a Wizard-of-Oz dermatology case study with users, clinicians and a medical LLM CHI 'EA 2024 Paper Not Available

1.8 Dental Agents

Title Venue Date Paper Link Project Page
OPGAgent: An Agent for Auditable Dental Panoramic X-ray Interpretation arXiv 2026.03 Paper Not Available
DentEval: Fine-tuning-Free Expert-Aligned Assessment in Dental Education via LLM Agents MICCAI 2025 Paper Star
GitHub

1.9 Genomics & Biomarker Agents

Title Venue Date Paper Link Project Page
Autonomous Agent-Orchestrated Digital Twins (AADT): State Synchronization in Rare Genetic Disorders arXiv 2026.03 Paper Not Available
ProtRLSearch: A Multi-Round Multimodal Protein Search Agent with LLMs Trained via RL arXiv 2026.03 Paper Not Available
Geneagent: self-verification language agent for gene-set analysis using domain databases Nature Methods 2025 Paper Star
GitHub
CRISPR-GPT for agentic automation of gene-editing experiments Nature BME 2025 Paper Star
GitHub
HEAL-KGGen: A Hierarchical Multi-Agent LLM Framework for Genetic Biomarker-Based Medical Diagnosis biorxiv 2025 Paper Star
GitHub
AI-HOPE: An AI-Driven conversational agent for enhanced clinical and genomic data integration Bioinformatics 2024.12 Paper Star
GitHub
dna-claude-analysis: AI-powered personal genome analysis agent using Claude GitHub 2025 Not Available Star
GitHub

1.10 EHR & Clinical Note Agents

Title Venue Date Paper Link Project Page
Beyond the Individual: Virtualizing Multi-Disciplinary Reasoning for Clinical Intake via Collaborative Agents ACL'26 Findings 2026.04 Paper Star
GitHub
Symphony for Medical Coding: A Next-Generation Agentic System for Scalable and Explainable Medical Coding arXiv 2026.03 Paper Not Available
Can LLM Agents Generate Real-World Evidence? Evaluating Observational Studies in Medical Databases arXiv 2026.03 Paper Star
GitHub
From Physician Expertise to Clinical Agents: Preserving, Standardizing, and Scaling Physicians' Medical Expertise arXiv 2026.03 Paper Not Available
Empowering Locally Deployable Medical Agent via State Enhanced Logical Skills for FHIR-based Clinical Tasks arXiv 2026.03 Paper Not Available
When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows arXiv 2026.03 Paper Not Available
TRACE: Temporal Reasoning via Agentic Context Evolution for Streaming EHRs arXiv 2026.02 Paper Not Available
AgentEHR: Advancing Autonomous Clinical Decision-Making via Retrospective Summarization arXiv 2026.01 Paper Not Available
ExperienceWeaver: Optimizing Small-sample Experience Learning for Clinical Text Improvement arXiv 2026.02 Paper Not Available
Hybrid-Code: A Privacy-Preserving, Redundant Multi-Agent Framework for Reliable Local Clinical Coding arXiv 2025.12 Paper Not Available
HARMON-E: Hierarchical Agentic Reasoning for Multimodal Oncology Notes to Extract Structured Data arXiv 2025.12 Paper Not Available
ClinNoteAgents: An LLM Multi-Agent System for Predicting and Interpreting Heart Failure 30-Day Readmission from Clinical Notes arXiv 2025.12 Paper Not Available
MedDCR: Learning to Design Agentic Workflows for Medical Coding arXiv 2025.11 Paper Not Available
OEMA: Ontology-Enhanced Multi-Agent Collaboration Framework for Zero-Shot Clinical Named Entity Recognition arXiv 2025.11 Paper Not Available
Grounded by Experience: Generative Healthcare Prediction Augmented with Hierarchical Agentic Retrieval arXiv 2025.11 Paper Not Available
Traj-CoA: Patient Trajectory Modeling via Chain-of-Agents for Lung Cancer Risk Prediction NeurIPS'25 Workshop 2025.10 Paper Not Available
Automated Clinical Problem Detection from SOAP Notes using a Collaborative Multi-Agent LLM Architecture arXiv 2025.08 Paper Not Available
SNOW: Agent-Based Feature Generation from Clinical Notes for Outcome Prediction arXiv 2025.08 Paper Project
Trustworthy Agents for Electronic Health Records through Confidence Estimation arXiv 2025.8 Paper Star
GitHub
Infherno: End-to-end agent-based FHIR resource synthesis from free-form clinical notes arXiv 2025.07 Paper Star
GitHub
From EHRs to Patient Pathways: Scalable Modeling of Longitudinal Health Trajectories with LLMs arXiv 2025.6 Paper Not Available
CARE-AD: a multi-agent large language model framework for Alzheimer’s disease prediction npj Digital Medicine 2025 Paper Star
GitHub
Colacare: Enhancing electronic health record modeling through large language model-driven multi-agent collaboration arXiv 2024.10 Paper [project]
EHRFlow: A Large Language Model-Driven Iterative Multi-Agent Electronic Health Record Data Analysis Workflow KDD'24 Workshop 2024.06 Paper Star
GitHub
A multi-agent architecture for privacy-preserving natural language interaction with FHIR-based electronic health records IEEE SoftCOM 2024 Paper Not Available

1.11 Surgical Agents

Title Venue Date Paper Link Project Page
CSAP-Assist: Instrument-Agent Dialogue Empowered Vision-Language Models for Collaborative Surgical Action Planning MICCAI 2025 Paper Star
GitHub
Privacy-Preserving Operating Room Workflow Analysis using Digital Twins arXiv 2025.4 Paper Not Available

1.12 Education Agents

Title Venue Date Paper Link Project Page
Dialogue to Question Generation for Evidence-based Medical Guideline Agent Development ML4H 2026.03 Paper Not Available
An Agentic AI Framework for Training General Practitioner Student Skills arXiv 2025.12 Paper Not Available
MedTutor-R1: Socratic Personalized Medical Teaching with Multi-Agent Simulation arXiv 2025.12 Paper Star
GitHub
Exploring Community-Powered Conversational Agent for Health Knowledge Acquisition arXiv 2025.12 Paper Not Available

1.13 Reasoning & Multi Agent Techniques

Title Venue Date Paper Link Project Page
CARE: Privacy-Compliant Agentic Reasoning with Evidence Discordance arXiv 2026.04 Paper Not Available
Improving Clinical Diagnosis with Counterfactual Multi-Agent Reasoning arXiv 2026.03 Paper Not Available
MediHive: A Decentralized Agent Collective for Medical Reasoning IEEE ICHI 2026.03 Paper Not Available
ClinicalAgents: Multi-Agent Orchestration for Clinical Decision Making with Dual-Memory arXiv 2026.03 Paper Not Available
Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA arXiv 2026.03 Paper Not Available
CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare CVPR Findings 2026.03 Paper Not Available
Unified-MAS: Universally Generating Domain-Specific Nodes for Empowering Automatic Multi-Agent Systems arXiv 2026.03 Paper Star
GitHub
TheraAgent: Multi-Agent Framework with Self-Evolving Memory for PET Theranostics arXiv 2026.03 Paper Not Available
OpenHospital: A Thing-in-itself Arena for Evolving and Benchmarking LLM-based Collective Intelligence arXiv 2026.03 Paper Not Available
MedScope: Incentivizing "Think with Videos" for Clinical Reasoning via Coarse-to-Fine Tool Calling arXiv 2026.02 Paper Not Available
ATPO: Adaptive Tree Policy Optimization for Multi-Turn Medical Dialogue ICLR 2026.03 Paper Not Available
MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus arXiv 2026.03 Paper Not Available
MedCollab: Causal-Driven Multi-Agent Collaboration for Full-Cycle Clinical Diagnosis arXiv 2026.03 Paper Not Available
From Conflict to Consensus: Boosting Medical Reasoning via Multi-Round Agentic RAG arXiv 2026.03 Paper Star
GitHub
TARSE: Test-Time Adaptation via Retrieval of Skills and Experience for Reasoning Agents arXiv 2026.03 Paper Not Available
A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series arXiv 2026.03 Paper Not Available
Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis? EACL Workshop 2026.03 Paper Not Available
MedClarify: An Information-Seeking AI Agent for Medical Diagnosis arXiv 2026.02 Paper Not Available
MedCoG: Maximizing LLM Inference Density in Medical Reasoning via Meta-Cognitive Regulation arXiv 2026.02 Paper Not Available
Closing Reasoning Gaps in Clinical Agents with Differential Reasoning Learning arXiv 2026.02 Paper Not Available
A Multi-Agent Framework for Medical AI: Leveraging GPT, LLaMA, and DeepSeek R1 arXiv 2026.02 Paper Not Available
Pruning Minimal Reasoning Graphs for Efficient Retrieval-Augmented Generation arXiv 2026.02 Paper Not Available
RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis arXiv 2026.02 Paper Not Available
Agentic Reasoning for Large Language Models arXiv 2026.01 Paper Star
GitHub
EvoClinician: A Self-Evolving Agent for Multi-Turn Medical Diagnosis arXiv 2026.01 Paper Star
GitHub
Scaling Medical Reasoning Verification via Tool-Integrated Reinforcement Learning arXiv 2026.01 Paper Not Available
DEEPMED: Building a Medical DeepResearch Agent via Multi-hop Med-Search Data arXiv 2026.01 Paper Not Available
Multi-Aspect Knowledge-Enhanced Medical Vision-Language Pretraining with Multi-Agent Data Generation arXiv 2025.12 Paper Not Available
Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis arXiv 2025.12 Paper Not Available
AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning arXiv 2025.12 Paper Star
Github
Multi-Agent Medical Decision Consensus Matrix System: An Intelligent Collaborative Framework for Oncology MDT Consultations arXiv 2025.12 Paper Not Available
Multi-Agent Intelligence for Multidisciplinary Decision-Making in Gastrointestinal Oncology arXiv 2025.12 Paper Not Available
DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning arXiv 2025.12 Paper Star
Github
MCP-AI: Protocol-Driven Intelligence Framework for Autonomous Reasoning in Healthcare arXiv 2025.12 Paper Not Available
Many-to-One Adversarial Consensus: Exposing Multi-Agent Collusion Risks in AI-Based Healthcare arXiv 2025.12 Paper Not Available
Thucy: An LLM-based Multi-Agent System for Claim Verification across Relational Databases AAAI Workshop 2025.12 Paper Not Available
UCAgents: Unidirectional Convergence for Visual Evidence Anchored Multi-Agent Medical Decision-Making arXiv 2025.12 Paper Star
GitHub
KOM: A Multi-Agent Artificial Intelligence System for Precision Management of Knee Osteoarthritis (KOA) arXiv 2025.11 Paper Not Available
KRAL: Knowledge and Reasoning Augmented Learning for LLM-assisted Clinical Antimicrobial Therapy arXiv 2025.11 Paper Not Available
MedResearcher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework arXiv 2025.8 Paper Star
GitHub
ConfAgents: A Conformal-Guided Multi-Agent Framework for Cost-Efficient Medical Diagnosis arXiv 2025.8 Paper Star
GitHub
Tree-of-Reasoning: Towards Complex Medical Diagnosis via Multi-Agent Reasoning with Evidence Tree arXiv 2025.8 Paper Star
GitHub
End-to-End Agentic RAG System Training for Traceable Diagnostic Reasoning arXiv 2025.8 Paper Star
GitHub
A Multi-Agent Approach to Neurological Clinical Reasoning arXiv 2025.8 Paper Not Available
KERAP: A knowledge-enhanced reasoning approach for accurate zero-shot diagnosis prediction arXiv 2025.7 Paper Star
GitHub
MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal Medical Reasoning arXiv 2025.06 Paper Not Available
An agentic system for rare disease diagnosis with traceable reasoning arXiv 2025.6 Paper [demo]
MedOrch: Medical Diagnosis with Tool-Augmented Reasoning Agents for Flexible Extensibility arXiv 2025.6 Paper Not Available
The Optimization Paradox in Clinical AI Multi-Agent Systems arXiv 2025.6 Paper Star
GitHub
DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue EMNLP 2025.5 Paper Star
GitHub
Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making arXiv 2025.5 Paper Not Available
MDTeamGPT: A Self-Evolving LLM-Based Multi-Agent Framework for Multi-Disciplinary Team Medical Consultation EMNLP 2025.3 Paper Star
GitHub
The Application of MATEC (Multi-AI Agent Team Care) Framework in Sepsis Care arXiv 2025.3 Paper Not Available
Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge EMNLP Findings 2025.2 Paper Star
GitHub
A Layered Debating Multi-Agent System for Similar Disease Diagnosis NAACL 2025 Paper Not Available
KG4Diagnosis: A Hierarchical Multi-Agent LLM Framework with Knowledge Graph Enhancement arXiv 2024.12 Paper Not Available
Zodiac: A Cardiologist-Level LLM Framework for Multi-Agent Diagnostics arXiv 2024.10 Paper Not Available
MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning ACL 2024 Findings 2023.11 Paper Star
GitHub

2. Patient-Facing Applications

2.1 Mental Health & CBT Agents

Title Venue Date Paper Link Project Page
OMIND: Framework for Knowledge Grounded Finetuning and Multi-Turn Dialogue Benchmark for Mental Health LLMs arXiv 2026.03 Paper Not Available
YAQIN: Culturally Sensitive, Agentic AI for Mental Healthcare Support Among Muslim Women in the UK arXiv 2026.03 Paper Not Available
MIND: Unified Inquiry and Diagnosis RL for Psychiatric Consultation arXiv 2026.03 Paper Not Available
SynthAgent: A Multi-Agent LLM Framework for Realistic Patient Simulation AAAI Workshop 2026.02 Paper Not Available
Advancing AI Trustworthiness Through Patient Simulation for Antidepressant Selection arXiv 2026.02 Paper Not Available
DemMA: Dementia Multi-Turn Dialogue Agent with Expert-Guided Reasoning and Action Simulation arXiv 2026.01 Paper Not Available
CittaVerse (一念万相) AI-powered reminiscence therapy platform for dementia/MCI using narrative identity, autobiographical memory scaffolding, and 6-dimension narrative quality scoring arXiv (in prep) Paper GitHub
coTherapist: A Behavior-Aligned Small Language Model to Support Mental Healthcare Experts arXiv 2026.01 Paper Not Available
Towards Efficient and Robust Linguistic Emotion Diagnosis for Mental Health arXiv 2026.01 Paper Not Available
ChatThero: An LLM-Supported Chatbot for Behavior Change and Therapeutic Support in Addiction Recovery arXiv 2025.08 Paper Star
GitHub Reproduce
VChatter: Exploring Generative Conversational Agents for Simulating Exposure Therapy to Reduce Social Anxiety arXiv 2025.06 Paper Not Available
AnnaAgent: Dynamic Evolution Agent System with Multi-Session Memory for Realistic Seeker Simulation ACL Findings 2025.06 Paper Star
GitHub
MIND: Towards Immersive Psychological Healing with Multi-Agent Inner Dialogue EMNLP Findings 2025.02 Paper Star
GitHub Reproduce
Cami: A counselor agent supporting motivational interviewing through state inference and topic exploration ACL 2025.02 Paper Star
GitHub
Autocbt: An autonomous multi-agent framework for cognitive behavioral therapy in psychological counseling arXiv 2025.01 Paper Not Available
PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children arXiv 2024.12 Paper Star
GitHub
Cactus: Towards psychological counseling conversations using cognitive behavioral theory EMNLP Findings 2024.07 Paper Star
GitHub
MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating arXiv 2024.07 Paper Star
GitHub
Compeer: A generative conversational agent for proactive peer support arXiv 2024.07 Paper Star
GitHub
Understanding the benefits and challenges of using large language model-based conversational agents for mental well-being support AMIA Annual Symposium Proceedings 2023.07 Paper Not Available

2.2 Clinical Communication & Intake Agents

Title Venue Date Paper Link Project Page
AI Agents for Conversational Patient Triage: Preliminary Simulation-Based Evaluation with Real-World EHR Data arXiv 2025.6 paper Not Available
A two-stage proactive dialogue generator for efficient clinical information collection Expert Systems with Applications 2025 Paper Not Available
PIORS: Personalized Intelligent Outpatient Reception based on Large Language Model with Multi-Agents Medical Scenario Simulation ACL Findings 2024.11 Paper Star
GitHub
A language model--powered simulated patient with automated feedback for history taking: Prospective study JMIR 2024 Paper Not Available
Conversational health agents: a personalized large language model-powered agent framework JAMIA Open 2024 Paper Star
GitHub
Talk2Care: Facilitating asynchronous patient-provider communication with large-language-model arXiv 2023.9 Paper Not Available

2.3 Screening & Personalized Care Agents

Title Venue Date Paper Link Project Page
NutriOrion: A Hierarchical Multi-Agent Framework for Personalized Nutrition Intervention arXiv 2026.02 Paper Not Available
FinAgent: An Agentic AI Framework Integrating Personal Finance and Nutrition Planning arXiv 2025.12 Paper Not Available
On-device Large Multi-modal Agent for Human Activity Recognition arXiv 2025.12 Paper Not Available
Causal Reinforcement Learning based Agent-Patient Interaction with Clinical Domain Knowledge arXiv 2025.12 Paper Not Available
AI-VaxGuide: An Agentic RAG-Based LLM for Vaccination Decisions arXiv 2025.07 Paper huggingface
A Conversational Agent for Early Detection of Neurotoxic Effects of Medications through Automated Intensive Observation PACIFIC SYMPOSIUM ON BIOCOMPUTING 2024 Paper Not Available

2.4 General-purpose Healthcare Avatars

Title Venue Date Paper Link Project Page
The Anatomy of a Personal Health Agent arXiv 2025.08 Paper Not Available
A general-purpose AI avatar in healthcare arXiv 2024.01 Paper Not Available

3. Drug Discovery & Development

Title Venue Date Paper Link Project Page
RexDrug: Reliable Multi-Drug Combination Extraction through Reasoning-Enhanced LLMs arXiv 2026.03 Paper Star
GitHub
ALPACA: A Reinforcement Learning Environment for Medication Repurposing in Alzheimer's Disease arXiv 2026.02 Paper Not Available
Causal-Enhanced AI Agents for Medical Research Screening arXiv 2026.01 Paper Not Available
MedAI: Evaluating TxAgent's Therapeutic Agentic Reasoning in the NeurIPS CURE-Bench Competition arXiv 2025.12 Paper Benchmark & Competition
BioScientistAgent: Designing LLM-Biomedical Agents with KG-Augmented RL Reasoning Modules biorxiv 2025.08 Paper Not Available
RAG-Enhanced Collaborative LLM Agents for Drug Discovery arXiv 2025.02 Paper Not Available
Large Language Model Agent for Modular Task Execution in Drug Discovery arXiv 2025.07 Paper Star
GitHub
AUTOCT: Automating Interpretable Clinical Trial Prediction with LLM Agents arXiv EMNLP Paper Star
GitHub
Llm agent swarm for hypothesis-driven drug discovery arXiv 2025.04 Paper Not Available
Txgemma: Efficient and agentic llms for therapeutics arXiv 2025.04 Paper Not Available
TrialGenie: Empowering Clinical Trial Design with Agentic Intelligence and Real World Data medRxiv 2025.04 Paper Not Available
TxAgent: An AI agent for therapeutic reasoning across a universe of tools arXiv 2025.03 Paper Star
GitHub
Drugagent: Automating ai-aided drug discovery programming through llm multi-agent collaboration AAAI 2025 workshop AI4Research 2024.11 Paper Star
GitHub
Drugagent: Multi-agent large language model-based reasoning for drug-target interaction prediction arXiv 2024.08 Paper Star
GitHub
PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models npj Digital Medicine 2024.01 Paper Not Available
MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance MLHC 2024 Paper Star
GitHub

4. Healthcare Administration & Workflow

Title Venue Date Paper Link Project Page
Caging the Agents: A Zero Trust Security Architecture for Autonomous AI in Healthcare arXiv 2026.03 Paper Not Available
UAV-MARL: Multi-Agent Reinforcement Learning for Time-Critical and Dynamic Medical Supply Delivery arXiv 2026.03 Paper Not Available
Position: Multi-Agent Algorithmic Care Systems Demand Contestability for Trustworthy AI arXiv 2026.03 Paper Not Available
Six Interventions for the Responsible and Ethical Implementation of Medical AI Agents arXiv 2026.03 Paper Not Available
Rethinking Health Agents: From Siloed AI to Collaborative Decision Mediators CHI Workshop 2026.03 Paper Not Available
Goal-Driven Risk Assessment for LLM-Powered Systems: A Healthcare Case Study HealthSec/ACSAC 2026.03 Paper Not Available
The Doctor Will (Still) See You Now: On the Structural Limits of Agentic AI in Healthcare arXiv 2026.02 Paper Not Available
Agentic AI, Medical Morality, and the Transformation of the Patient-Physician Relationship arXiv 2026.02 Paper Not Available
MedBeads: An Agent-Native, Immutable Data Substrate for Trustworthy Medical AI arXiv 2026.02 Paper Star
GitHub
Engineering AI Agents for Clinical Workflows: A Case Study in Architecture CAIN 2026 Paper Not Available
Agentic AI Governance and Lifecycle Management in Healthcare arXiv 2026.01 Paper Not Available
Bayesian Orchestration of Multi-LLM Agents for Cost-Aware Sequential Decision-Making arXiv 2026.01 Paper Not Available
AutoHealth: An Uncertainty-Aware Multi-Agent System for Autonomous Health Data Modeling arXiv 2026.02 Paper Not Available
Fair-GNE: Generalized Nash Equilibrium-Seeking Fairness in Multiagent Healthcare Automation arXiv 2025.11 Paper Not Available
MedBuild AI: An Agent-Based Hybrid Intelligence Framework for Reshaping Agency in Healthcare Infrastructure Planning through Generative Design for Medical Architecture arXiv 2025.11 Paper Not Available
ShortageSim: Simulating Drug Shortages under Information Asymmetry arXiv 2025.09 Paper Star
GitHub
Code Like Humans: A Multi-Agent Solution for Medical Coding arXiv 2025.09 Paper Not Available
Resilient Multi-Agent Negotiation for Medical Supply Chains: Integrating LLMs and Blockchain arXiv 2025.07 Paper Not Available
Standard Applicability Judgment and Cross-jurisdictional Reasoning: A RAG-based Framework for Medical Device Compliance arXiv 2025.06 Paper Not Available
Operating room workflow analysis via reasoning segmentation over digital twins MICCAI 2025.03 Paper Not Available
MedScrubCrew: A Medical Multi-Agent Framework for Automating Appointment Scheduling Healthcare (Basel) 2025 Paper Not Available
IMAS: A Comprehensive Agentic Approach to Rural Healthcare Delivery arXiv 2024.10 Paper Star
GitHub
Exploring llm multi-agents for icd coding arXiv 2024.06 Paper Not Available
Llm-based framework for administrative task automation in healthcare IEEE ISDFS 2024 Paper Not Available

5. Datasets & Benchmarks

Title Venue Date Paper Link Project Page
Doctorina MedBench: End-to-End Evaluation of Agent-Based Medical AI arXiv 2026.03 Paper Not Available
Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos arXiv 2026.03 Paper Star
GitHub
Project
MedMASLab: A Unified Orchestration Framework for Benchmarking Multimodal Medical Multi-Agent Systems arXiv 2026.03 Paper Not Available
MedPriv-Bench: Benchmarking the Privacy-Utility Trade-off of LLMs in Medical Open-End Question Answering arXiv 2026.03 Paper Not Available
OpenHospital: A Thing-in-itself Arena for Evolving and Benchmarking LLM-based Collective Intelligence arXiv 2026.03 Paper Not Available
Can LLM Agents Generate Real-World Evidence? Evaluating Observational Studies in Medical Databases arXiv 2026.03 Paper Star
GitHub
LiveMedBench: A Contamination-Free Medical Benchmark for LLMs with Automated Rubric Evaluation arXiv 2026.02 Paper Not Available
Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis? EACL Workshop 2026.03 Paper Not Available
MEDVISTAGYM: A Scalable Training Environment for Thinking with Medical Images arXiv 2026.01 Paper Not Available
MedConsultBench: A Full-Cycle, Fine-Grained, Process-Aware Benchmark for Medical Consultation arXiv 2026.01 Paper Not Available
MedEinst: Benchmarking the Einstellung Effect in Medical LLMs arXiv 2026.01 Paper Not Available
ART: Action-based Reasoning Task Benchmarking for Medical AI Agents arXiv 2026.01 Paper Not Available
MedDialogRubrics: A Comprehensive Benchmark for Multi-turn Medical Consultations arXiv 2026.01 Paper Not Available
Knowing When to Abstain: Medical LLMs Under Clinical Uncertainty EACL 2026 Paper Not Available
Automated Rubrics for Reliable Evaluation of Medical Dialogue Systems arXiv 2026.01 Paper Not Available
Improving the Safety and Trustworthiness of Medical AI via Multi-Agent Evaluation Loops arXiv 2026.01 Paper Not Available
AgentsEval: Clinically Faithful Evaluation of Medical Imaging Reports via Multi-Agent Reasoning arXiv 2026.01 Paper Not Available
ClinDEF: A Dynamic Evaluation Framework for Large Language Models in Clinical Reasoning arXiv 2025.12 Paper Not Available
ReX-MLE: The Autonomous Agent Benchmark for Medical Imaging Challenges arXiv 2025.12 Paper Github
MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery arXiv 2025.12 Paper Not Available
CP-Env: Evaluating Large Language Models on Clinical Pathways in a Controllable Hospital Environment arXiv 2025.12 Paper Star
GitHub
AutoMedic: An Automated Evaluation Framework for Clinical Conversational Agents with Medical Dataset Grounding arXiv 2025.12 Paper Not Available
Scalably Enhancing the Clinical Validity of a Task Benchmark with Physician Oversight arXiv 2025.12 Paper Not Available
First, do NOHARM: towards clinically safe large language models arXiv 2025.12 Paper Not Available
Medical Malice: A Dataset for Context-Aware Safety in Healthcare LLMs arXiv 2025.11 Paper Not Available
MedBench v4: A Robust and Scalable Benchmark for Evaluating Chinese Medical Language Models, Multimodal Models, and Intelligent Agents arXiv 2025.11 Paper Not Available
Are Vision Language Models Ready for Clinical Diagnosis? A 3D Medical Benchmark for Tumor-centric Visual Question Answering arXiv 2025.05 Paper Star
GitHub
MedAgentBoard: Benchmarking Multi-Agent Collaboration with Conventional Methods for Diverse Medical Tasks NeurIPS 2025.05 Paper Star
GitHub
MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning arXiv 2025.03 Paper Star
GitHub
MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents NEJM AI 2025.01 Paper Star
GitHub
CliBench: A Multifaceted and Multigranular Evaluation of Clinical Diagnosis with LLMs arXiv 2024.06 Paper Star
GitHub
MediQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning arXiv 2024.06 Paper Star
GitHub
AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments arXiv 2024.05 Paper Star
GitHub
Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents arXiv 2024.05 Paper Star
GitHub

6. Related Surveys

6.1 General Healthcare AI Agent Surveys

Title Venue Date Paper Link Project Page
Rethinking Health Agents: From Siloed AI to Collaborative Decision Mediators CHI Workshop 2026 Paper Not Available
Six Interventions for the Responsible and Ethical Implementation of Medical AI Agents arXiv 2026 Paper Not Available
A Comprehensive Survey of Agentic AI in Healthcare Authorea / TechRxiv 2025 Paper Star
GitHub
AI agents in clinical medicine: a systematic review medRxiv 2025 Paper Not Available
LLM-based agentic systems in medicine and healthcare Nature Machine Intelligence 2024 Paper Not Available
A Survey of LLM-based Agents in Medicine: How far are we from Baymax? ACL 2025 Findings 2025 Paper Star
GitHub
The Landscape of Medical Agents: A Survey TechRxiv 2025 Paper Star
GitHub
Agentic large-language-model systems in medicine: A systematic review and taxonomy TechRxiv 2025 Paper Not Available
Agentic large language models for healthcare: current progress and future opportunities Medicine Advances (Wiley) 2025 Paper Not Available
A Survey of LLM-based Multi-agent Systems in Medicine TechRxiv / OpenReview 2025 Paper Not Available
AI agent in healthcare: applications, evaluations, and future directions npj Artificial Intelligence 2026 Paper Not Available
A foundational architecture for AI agents in healthcare Cell Reports Medicine 2025 Paper Not Available
Coordinated AI agents for advancing healthcare Nature Biomedical Engineering 2025 Paper Not Available
Next-generation agentic AI for transforming healthcare Cell Reports Medicine 2025 Paper Not Available
Large Language Model Agents for Biomedicine: A Comprehensive Review Information (MDPI) 2025 Paper Not Available
Artificial intelligence agents in healthcare research: A scoping review PLOS ONE 2025 Paper Not Available
Benchmarking large language model-based agent systems for clinical decision tasks npj Digital Medicine 2026 Paper Not Available
Enhancing diagnostic capability with multi-agents conversational large language models npj Digital Medicine 2025 Paper Star
GitHub
Applications of artificial intelligence-based conversational agents in healthcare: A systematic umbrella review International Journal of Medical Informatics 2025 Paper Not Available
Scoping Review of Agentic AI Systems in Healthcare HAL 2025 Paper Not Available
AI Agents in Modern Healthcare: From Foundation to Pioneer Preprints.org 2025 Paper Not Available
Multi-Agent AI Systems in Healthcare: A Systematic Review Enhancing Clinical Decision-Making Asian Journal of Medical Principles and Clinical Practice 2025 Paper Not Available

6.2 Radiology-Specific Surveys

Title Venue Date Paper Link Project Page
Agentic AI in Radiology: Evolution from Large Language Models to Future Clinical Integration Radiology: Artificial Intelligence (RSNA) 2025 Paper Not Available
From chatbots to agentic workflows: ensuring responsible deployment of large language models in radiology Indian Journal of Radiology and Imaging 2025 Paper Not Available
Agentic AI and Large Language Models in Radiology: Opportunities and Hallucination Challenges Bioengineering (MDPI) 2025 Paper Not Available
Agentic systems in radiology: Design, Applications, Evaluation, and Challenges arXiv 2025 Paper Not Available
Agentic AI in radiology: emerging potential and unresolved challenges British Journal of Radiology 2025 Paper Not Available
Agentic systems in radiology: Principles, opportunities, privacy risks, regulation, and sustainability concerns Radiography 2025 Paper Not Available
The Role of Agentic AI in Musculoskeletal Radiology: A Scoping Review Tomography (MDPI) 2025 Paper Not Available

6.3 Specialty-Specific Surveys

Title Venue Date Paper Link Project Page
Computational Pathology in the Era of Emerging Foundation and Agentic AI -- International Expert Perspectives arXiv 2026 Paper Not Available
Reimagining psychiatric care with agentic AI: promise, challenges, and a roadmap forward npj Digital Medicine 2026 Paper Not Available
Large language model-driven agents in nursing practice: A scoping review Nurse Education Today 2025 Paper Not Available
Simulated patient systems powered by large language model-based AI agents offer potential for transforming medical education Communications Medicine 2025 Paper Not Available

6.4 Biomedical Research & Discovery Surveys

Title Venue Date Paper Link Project Page
Empowering biomedical discovery with AI agents Cell 2024 Paper Star
GitHub
Agentic AI and the rise of in silico team science in biomedical research Nature Biotechnology 2026 Paper Not Available

Open-Source Projects & Tools

Beyond academic research papers, several open-source projects and tools provide practical infrastructure for building healthcare AI agent systems. This section catalogs actively maintained projects, MCP servers, and frameworks.

Healthcare-Specific Agent Systems

Project Description Links
ClawdTalk Voice calling and SMS skill for AI agents enabling telephony capabilities for healthcare communication Star
GitHub | Website
Healthcare Agent Orchestrator Azure-based modular specialized agents for multi-disciplinary healthcare workflows Star
GitHub
Multi-Agent Medical Assistant GenAI-powered multi-agentic medical diagnostics chatbot with advanced RAG and medical imaging Star
GitHub
MedicalCoderSwarm Production-grade multi-agent system for medical diagnosis and coding with specialized AI agents Star
GitHub
AI-Agents-for-Medical-Diagnostics LLM-based AI agents that analyze complex medical cases by integrating specialist insights Star
GitHub
HealthGPT (Stanford) Experimental iOS app for natural language interaction with Apple Health data Star
GitHub
DoctorGPT Offline-first LLM fine-tuned on medical dialogue data that can pass the US Medical Licensing Exam Star
GitHub
MedSci Skills 32 open-source Claude Code skills for the full medical research lifecycle — anti-hallucination literature search (PubMed, Semantic Scholar, bioRxiv), meta-analysis pipeline (PROSPERO, PRISMA, QUADAS-2), reporting guideline audits (STROBE, PRISMA, STARD, CONSORT, TRIPOD+AI), statistical analysis in Python/R, publication-ready figures, study design review, grant proposals, peer review, and academic presentation prep. Three runnable end-to-end demos on public datasets. Built by a physician-researcher, MIT licensed Star
GitHub | Website
Anthropic Healthcare Skills Official healthcare skills including FHIR developer tools, prior auth review, and clinical trial protocol generation GitHub
Voice AI SDR Agent Production-ready autonomous AI phone agent for patient outreach, appointment scheduling, and healthcare communication using LangGraph and Twilio Star
GitHub
Healthcare Agents 51 open-source AI agents for US healthcare administration with MHA-level expertise across revenue cycle, compliance, quality, clinical ops, payer relations, health IT, and pharmacy. Features real regulatory citations (42 CFR, CMS) and works with Claude Code, Cursor, Codex CLI, and 10+ AI tools Star
GitHub
CittaVerse Open-source AI reminiscence therapy platform with narrative scoring, multi-session memory, and B2B2C deployment for eldercare institutions — includes core framework, auto-evolve agent, and therapy pipeline Star
GitHub
AnveVoice AI voice agent for healthcare websites — schedules appointments, answers patient FAQs, handles intake forms, and speaks 50+ languages with <700ms latency Website
OpenHealth Privacy-first AI health assistant that uses personal health data (blood tests, checkups, family history, symptoms) with local LLM support via Ollama — supports LLaMA, DeepSeek, GPT, Claude, and Gemini for personalized health insights Star
GitHub

Healthcare MCP Servers

Project Description Links
Healthcare MCP Public MCP server providing AI assistants access to FDA drug info, PubMed, clinical trials, ICD-10, and medical calculators Star
GitHub
TruthStack MCP Supplement-drug interaction safety tool for AI agents with 1,008 directed interactions and FDA adverse event signals Star
GitHub | LangChain | API
Medical MCP MCP server querying FDA, WHO, PubMed, Google Scholar, and RxNorm APIs Star
GitHub
FHIR MCP Server FHIR-compliant MCP server with full CRUD operations and LOINC integration for natural language queries Star
GitHub
Google Cloud Healthcare API MCP MCP server for Google Cloud Healthcare API FHIR resources and medical research APIs Star
GitHub
AWS HealthLake MCP Server MCP server for AWS HealthLake FHIR operations with 11 tools for FHIR resource management Documentation
Awesome Medical MCP Servers Curated collection of Medical MCP servers for healthcare data integration Star
GitHub
BGPT MCP Hosted MCP server for searching scientific papers with full-text experimental data extraction; covers biomedical, clinical, and life science studies; search_papers tool returns structured data (methods, results, sample sizes); 50 free searches Star
GitHub
Fulcra Context MCP Personal context MCP server providing unified access to biometric, sleep, activity, and calendar data for AI agents via the Fulcra Life API — enables healthcare AI applications to incorporate real-time patient-reported wellness data Star
GitHub | Python Client

| Genomic Agent Discovery | Multi-agent MCP server for genomic analysis — specialized AI agents analyze raw DNA files across 12 databases (ClinVar, GWAS, AlphaMissense, CPIC, gnomAD, etc.) and coordinate findings through shared MCP tools. Privacy-first, runs 100% locally | Star
GitHub |

Healthcare RAG & Knowledge Systems

Project Description Links
MedRAG Toolkit Systematic toolkit for Retrieval-Augmented Generation on medical QA with MIRAGE benchmark Star
GitHub
MedGraph-AI RAG agent for healthcare using LangChain with Neo4j knowledge graphs Star
GitHub

Medical Imaging & Deep Learning Frameworks

Project Description Links
MONAI PyTorch-based framework for deep learning in healthcare imaging (by NVIDIA & King's College London) Star
GitHub
PyHealth Deep learning toolkit for healthcare applications supporting patient prediction tasks Star
GitHub

Healthcare Workflow & Team Coordination Tools

Project Description Links
Taskade Open-source AI workspace for multi-agent workflow coordination, checklists, and team task automation — applicable to care-team task routing, intake triage, and SOP management Star
GitHub | Website

Related Awesome Lists

Project Description Links
Awesome Healthcare Curated list of awesome open source healthcare software, libraries, tools and resources Star
GitHub
Awesome Healthcare AI Curated list of open source healthcare tools, algorithms, datasets and research papers Star
GitHub

Acknowledgement

This awesome list is maintained by a collaborative team from the University of Notre Dame, Johns Hopkins University, and Emory University. The authors of the survey paper are Gelei Xu*, Xueyang Li*, Yixiong Chen*, Yuying Duan*, Shuqing Wu*, Alexander Yu*, Ching-Hao Chiu*, Juntong Ni*, Ningzhi Tang, Toby Jia-Jun Li, Alan Yuille, Wei Jin, and Yiyu Shi (* equal contribution).

Paper Annotations

To promote transparency and reproducibility, we provide the structured annotation sheet used in our survey, including labels for technologies, medical domains, tasks, development stages, data modalities, and evaluation metrics. View the full Google Sheet here: Link

Star History

Star History Chart