• About
  • Masthead
  • License Content
  • Advertise
  • Submit Press Release
  • RSS/Email List
  • 2MM Podcast
  • Write for us
  • Contact Us
2 Minute Medicine
No Result
View All Result

No products in the cart.

SUBSCRIBE
  • Specialties
    • All Specialties, All Recent Reports
    • Cardiology
    • Chronic Disease
    • Dermatology
    • Emergency
    • Endocrinology
    • Gastroenterology
    • Imaging and Intervention
    • Infectious Disease
    • Nephrology
    • Neurology
    • Obstetrics
    • Oncology
    • Ophthalmology
    • Pediatrics
    • Pharma
    • Preclinical
    • Psychiatry
    • Public Health
    • Pulmonology
    • Rheumatology
    • Surgery
  • Tools
    • EvidencePulse™
    • RVU Search
    • NPI Registry Lookup
  • Pharma
  • AI News
  • The Scan+
  • Classics™+
    • 2MM+ Online Access
    • Paperback and Ebook
  • Rewinds
  • Partners
    • License Content
    • Submit Press Release
    • Advertise with Us
  • Account
    • Subscribe
    • Sign-in
    • My account
2 Minute Medicine
  • Specialties
    • All Specialties, All Recent Reports
    • Cardiology
    • Chronic Disease
    • Dermatology
    • Emergency
    • Endocrinology
    • Gastroenterology
    • Imaging and Intervention
    • Infectious Disease
    • Nephrology
    • Neurology
    • Obstetrics
    • Oncology
    • Ophthalmology
    • Pediatrics
    • Pharma
    • Preclinical
    • Psychiatry
    • Public Health
    • Pulmonology
    • Rheumatology
    • Surgery
  • Tools
    • EvidencePulse™
    • RVU Search
    • NPI Registry Lookup
  • Pharma
  • AI News
  • The Scan+
  • Classics™+
    • 2MM+ Online Access
    • Paperback and Ebook
  • Rewinds
  • Partners
    • License Content
    • Submit Press Release
    • Advertise with Us
  • Account
    • Subscribe
    • Sign-in
    • My account
SUBSCRIBE
2 Minute Medicine
Subscribe
Home All Specialties

ChatGPT able to work through clinical vignettes with promising accuracy

byKiera Liblik
August 23, 2023
in All Specialties
Reading Time: 2 mins read
0
Share on FacebookShare on Twitter

1. In this clinical decision support study, Chat Generative Pre-training Transformer (ChatGPT) was presented with decision-making questions based on Merck Sharpe & Dohme (MSD) Clinical Manual scenarios, with an overall performance of 71.1% accuracy. 

2. In terms of prompt type, ChatGPT had the highest performance on final diagnosis questions (76.9% accuracy) and the lowest performance on initial differential diagnosis questions (60.3% accuracy).

Evidence Rating Level: 1 (Excellent)

Study Rundown: Artificial intelligence has gained increasing popularity in health care, with potential applications in individual patient care. Specifically, artificial intelligence may be leveraged to aid in clinical decision-making tasks such as developing a differential diagnosis. ChatGPT is an autoregressive large language model that can extract data from sources across the internet to form responses to use inputs. Accordingly, the current study assessed the performance of ChatGPT version 3.5 in answering questions about MSD Clinical Manual Vignettes. Experts in the field scored outputs as per the MSD Manual answer guide. ChatGPT was most likely to score correctly on final diagnosis questions, with the lowest scores on initial differential diagnosis questions. Accuracy did not vary significantly based on the acuity of the clinical presentation, patient age, or patient gender presented in the clinical vignette. It was identified that ChatGPT has constraints in clinical judgment, including difficulty with medication dosing. Further, the study is limited by the inaccessibility of ChatGPT’s training data, which may include the MSD Clinical Manual. The results of this study suggest that ChatGPT may be able to support clinicians in solving clinical vignettes and making decisions about patient care, with specific utility in establishing a final diagnosis.

Click to read the study in the Journal of Medical Internet Research

In-Depth [clinical decision support study]: This was a clinical support study that evaluated the performance of ChatGPT version 3.5 in correctly answering questions related to patient vignettes from the MSD Clinical Manual. The primary outcome of interest was the accuracy of ChatGPT overall. Results were also stratified by type of question, acuity of clinical presentation, and patient demographics (age, gender). A total of 36 clinical vignettes were used, with the exclusion of questions involving image analysis. Each clinical vignette was tested in three separate ChatGPT sessions, with two independent individuals scoring the answers. Notably, there were no scoring discrepancies throughout the study. Data were analyzed via multivariable linear regression. Overall, ChatGPT scored an accuracy of 71.8% (range 55.9% to 83.8%). The average score across question types varied from 60.3% for initial differential diagnosis to 76.9% for final diagnosis, suggesting that performance improves directly with increased data input. When assessing for performance by patient demographics, no significant difference was found based on age (p=0.35) or gender (p=0.59). Similarly, the acuity of the clinical vignettes, as assessed by the Emergency Severity Index, did not significantly impact accuracy (p=0.55). Finally, it was identified that the majority of medication errors were due to incorrect dosing. In summary, ChatGPT was able to solve clinical vignettes with improved accuracy as more information was introduced. Limitations were noted in the assessment of initial differential diagnosis as well as medication dosing.

RELATED REPORTS

PrescriberPoint AI agent automates prior authorization with 94.5% acceptance

2MM: AI Roundup – PrescriberPoint launches autonomous agent for prior authorization, Utah begins first in nation autonomous AI prescribing pilot, and JAMA study confirms ambient AI scribes return hours to the clinician work week

Large language models (LLMs) performed poorly in navigating early clinical diagnostic uncertainty

Image: PD

©2023 2 Minute Medicine, Inc. All rights reserved. No works may be reproduced without expressed written consent from 2 Minute Medicine, Inc. Inquire about licensing here. No article should be construed as medical advice and is not intended as such by the authors or by 2 Minute Medicine, Inc.

Tags: All specialtiesartificial intelligenceChat Generative Pre-training TransformerchatGPTclinical decision support studyclinical decision-makingtechnology
Previous Post

Primary hepatectomy may be superior to conventional hepatectomy for primary hepatocellular carcinoma

Next Post

Transcranial direct current stimulation does not improve outcomes for major depressive disorder

RelatedReports

Record-based algorithm may improve lung cancer screening follow-up
AI Roundup

PrescriberPoint AI agent automates prior authorization with 94.5% acceptance

May 12, 2026
2MM: AI Roundup- AI Cancer Test, Smarter Hospitals, Faster Drug Discovery, and Mental Health Tech [May 2nd, 2025]
AI Roundup

2MM: AI Roundup – PrescriberPoint launches autonomous agent for prior authorization, Utah begins first in nation autonomous AI prescribing pilot, and JAMA study confirms ambient AI scribes return hours to the clinician work week

May 11, 2026
No obesity paradox found between BMI, stroke, and death
Artificial Intelligence

Large language models (LLMs) performed poorly in navigating early clinical diagnostic uncertainty

April 20, 2026
American Academy of Pediatrics recommends standards for adverse event disclosures
AI Roundup

Brown University study warns of systemic ethical risks in artificial intelligence therapy chatbots

April 10, 2026
Next Post
Children’s hospital visits for suicide ideation and attempts are increasing

Transcranial direct current stimulation does not improve outcomes for major depressive disorder

Inappropriate hospital admission as a risk factor for the subsequent development of adverse events

The 2 Minute Medicine Podcast Episode 15

The 2 Minute Medicine Podcast Episode 22

2 Minute Medicine® is an award winning, physician-run, expert medical media company. Our content is curated, written and edited by practicing health professionals who have clinical and scientific expertise in their field of reporting. Our editorial management team is comprised of highly-trained MD physicians. Join numerous brands, companies, and hospitals who trust our licensed content.

Recent Reports

  • Food coloring additives are associated with higher incidence of type 2 diabetes
  • Children conceived through infertility treatments may have similar growth compared to naturally conceived children
  • Ring-augmented one-anastomosis gastric bypass may not improve weight loss compared to conventional one-anastomosis gastric bypass
License Content
Terms of Use | Disclaimer
Cookie Policy
Privacy Statement (EU)
Disclaimer
  • Specialties
    • All Specialties, All Recent Reports
    • Cardiology
    • Chronic Disease
    • Dermatology
    • Emergency
    • Endocrinology
    • Gastroenterology
    • Imaging and Intervention
    • Infectious Disease
    • Nephrology
    • Neurology
    • Obstetrics
    • Oncology
    • Ophthalmology
    • Pediatrics
    • Pharma
    • Preclinical
    • Psychiatry
    • Public Health
    • Pulmonology
    • Rheumatology
    • Surgery
  • Tools
    • EvidencePulse™
    • RVU Search
    • NPI Registry Lookup
  • Pharma
  • AI News
  • The Scan
  • Classics™
    • 2MM+ Online Access
    • Paperback and Ebook
  • Rewinds
  • Partners
    • License Content
    • Submit Press Release
    • Advertise with Us
  • Account
    • Subscribe
    • Sign-in
    • My account
No Result
View All Result

© 2026 2 Minute Medicine, Inc. - Physician-written medical news.