0
Are Artificial Intelligence Agents Helpers to Doctors or Are They New Health Professionals

Are Artificial Intelligence Agents Helpers to Doctors or Are They New Health Professionals

Only 27% of healthcare workers’ time is spent on direct patient care; the rest goes to paperwork and admin tasks. AI agents promise to change this balance. Where do we stand today?

🌟 Why Did We Write This?

The time consumed by electronic records and administrative work keeps growing, fueling physician burnout. Is there a reliable way to change the picture? 🤔

The leading candidate: AI agents. Beyond chatbots, these systems can handle record management, lab interpretation, and workflow integration. Will they remain assistants that ease workload, or evolve into a “new healthcare professional”? 👩‍⚕️🤖

To explore this, we review results from MedAgentBench, a Stanford platform. Published in NEJM AI on August 14, 2025 (DOI: 10.1056/AIdbp2500144), it is the first large-scale benchmark of AI agents in realistic EHR scenarios. Below we summarize its scope and model performance. 🚀

300
Clinical Tasks
100
Patient Profiles
700,000+
Data Points
12
Models Tested

Why Is This Important?

Studies show only 27% of time goes to direct care, while paperwork, EHR entry, and admin duties dominate. This imbalance drives burnout; AI agents aim to free clinical time by taking over routine tasks.

From Chatbots to Agents

  • Interpret complex instructions and plan actions,
  • Integrate information from multiple sources,
  • Interact with EHRs via standard APIs,
  • Execute step by step and present summaries to physicians.

Example: Beyond answering “what is pneumonia treatment?”, an agent can factor allergies, antibiogram data, drug interactions, and risk scores to prepare a personalized plan as a draft order.

📊 MedAgentBench: The First Medical AI Agent Benchmark

MedAgentBench, developed at Stanford, evaluates agents across realistic EHR workflows; 300 tasks and 100 patient profiles in a FHIR-compliant environment.

🧪 Lab Result Query 💊 Medication Ordering 📝 Data Entry 📈 Data Integration 📨 Referrals 🗒️ Documentation 🔍 Patient Info Retrieval

📈 Model Success Rates — Interactive

Switch views: Overall SR, Query SR, Action SR.

Success Rate (%)

Tip: Hover over bars for exact percentages. Click a model name to lock/unlock for comparison.

Model Overall SR (%) Query SR (%) Action SR (%)
Claude 3.5 Sonnet v2 69.67 85.33 54.00
GPT-4o 64.00 72.00 56.00
DeepSeek-V3 62.67 70.67 54.67
Gemini 1.5 Pro 62.00 52.67 71.33
GPT-4o mini 56.33 59.33 53.33
Qwen2.5 (72B) 51.33 38.67 64.00

⚠️ Common Errors

  • Not following required output format (e.g., returning text instead of numbers),
  • Invalid/incorrect API calls (payload or syntax errors),
  • Incomplete understanding of clinical context.

👩‍⚕️ Can They Replace Doctors?

Agents are not ready to replace physicians yet, but they already work as assistants by handling paperwork, order entry, and simple queries—freeing clinical time. With better reliability and standards, they may become a new category of healthcare professional.

🚀 Looking Ahead

  • Improved reproducibility and reliability,
  • Richer datasets including clinical notes and team collaboration,
  • Clear ethical, safety, and regulatory frameworks.

🔗 Source & Reference

MedAgentBench: A Virtual EHR Environment to Benchmark Medical LLM Agents — NEJM AI (Published: August 14, 2025). DOI: 10.1056/AIdbp2500144. GitHub: stanfordmlgroup/MedAgentBench

Sağlık ve Mutlulukla Kalın...

Sayfada yer alan yazılar sadece bilgilendirme amaçlıdır, tanı ve tedavi için mutlaka doktorunuza başvurunuz.

Kanser tanısına sahip bir hasta için online muayene randevusu hakkında bilgi almak için aşağıdaki formu doldurabilirsiniz.


İlgili Haberleri


Yapay Zeka ile Saptanan İNSÜLİN DİRENCİ 12 Kanser Türünün Riskini Artırıyor

Yapay Zeka ile Saptanan İNSÜLİN DİRENCİ 12 Kanser Türünün Riskini Artırıyor

Nature Communications • Şubat 2026 KANSERİN GİZLİ BİYOLOJİK MOTORU Yapay...

Tıpta Dijital Otonomi Dönemi: Utah Reçete Yetkisini Yapay Zekaya Devreden İlk Eyalet Oldu

Tıpta Dijital Otonomi Dönemi: Utah Reçete Yetkisini Yapay Zekaya Devreden İlk Eyalet Oldu

Sağlık Bilişimi ve Politika • 2026 Sağlık sistemindeki tıkanıklığı aşmak...

Dr. Google'dan Dr. ChatGPT'ye: Kanser Sürecinde Dijital Yol Arkadaşlığı ve Siberkondria

Dr. Google'dan Dr. ChatGPT'ye: Kanser Sürecinde Dijital Yol Arkadaşlığı ve Siberkondria

Onkoloji pratiğinde devrim yaratan Yapay Zeka ve İnternet; kanser hastaları...

OpenAI Neden 100 Milyon Dolar'a Torch'u Satın Aldı? Tıbbi Hafızanın Doğuşu

OpenAI Neden 100 Milyon Dolar'a Torch'u Satın Aldı? Tıbbi Hafızanın Doğuşu

Sağlığın Yeniden İnşası: Üretken Yapay Zeka ve Biyolojik Tasarım Çağı...

Hakkımda

Özgeçmişim, kanser tanı ve tedavisine dair çalışmalarım ve ilgi alanlarım için tıklayın.

Prof. Dr. Mustafa Özdoğan Hakkında