Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Generative pre-trained transformer 4o (GPT-4o) in solving text-based multiple response questions for European Diploma in Radiology (EDiR): a comparative study with radiologists
by
Burgetova, Andrea
, Sureyya, Ozbek Suha
, Lambert, Lukas
, Junquero, Vanesa
, Kyncl, Martin
, Oleaga, Laura
, Pristoupil, Jakub
, Merino, Cristina
in
Accuracy
/ Artificial intelligence
/ Candidates
/ Chatbots
/ Comparative studies
/ Diagnostic Radiology
/ Diplomas
/ Examination
/ Imaging
/ Internal Medicine
/ Interventional Radiology
/ Medicine
/ Medicine & Public Health
/ Natural language processing
/ Neuroradiology
/ Original
/ Original Article
/ Questions
/ Radiology
/ Reliability
/ Ultrasound
2025
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Generative pre-trained transformer 4o (GPT-4o) in solving text-based multiple response questions for European Diploma in Radiology (EDiR): a comparative study with radiologists
by
Burgetova, Andrea
, Sureyya, Ozbek Suha
, Lambert, Lukas
, Junquero, Vanesa
, Kyncl, Martin
, Oleaga, Laura
, Pristoupil, Jakub
, Merino, Cristina
in
Accuracy
/ Artificial intelligence
/ Candidates
/ Chatbots
/ Comparative studies
/ Diagnostic Radiology
/ Diplomas
/ Examination
/ Imaging
/ Internal Medicine
/ Interventional Radiology
/ Medicine
/ Medicine & Public Health
/ Natural language processing
/ Neuroradiology
/ Original
/ Original Article
/ Questions
/ Radiology
/ Reliability
/ Ultrasound
2025
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Generative pre-trained transformer 4o (GPT-4o) in solving text-based multiple response questions for European Diploma in Radiology (EDiR): a comparative study with radiologists
by
Burgetova, Andrea
, Sureyya, Ozbek Suha
, Lambert, Lukas
, Junquero, Vanesa
, Kyncl, Martin
, Oleaga, Laura
, Pristoupil, Jakub
, Merino, Cristina
in
Accuracy
/ Artificial intelligence
/ Candidates
/ Chatbots
/ Comparative studies
/ Diagnostic Radiology
/ Diplomas
/ Examination
/ Imaging
/ Internal Medicine
/ Interventional Radiology
/ Medicine
/ Medicine & Public Health
/ Natural language processing
/ Neuroradiology
/ Original
/ Original Article
/ Questions
/ Radiology
/ Reliability
/ Ultrasound
2025
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Generative pre-trained transformer 4o (GPT-4o) in solving text-based multiple response questions for European Diploma in Radiology (EDiR): a comparative study with radiologists
Journal Article
Generative pre-trained transformer 4o (GPT-4o) in solving text-based multiple response questions for European Diploma in Radiology (EDiR): a comparative study with radiologists
2025
Request Book From Autostore
and Choose the Collection Method
Overview
Objectives
This study aims to assess the accuracy of generative pre-trained transformer 4o (GPT-4o) in answering multiple response questions from the European Diploma in Radiology (EDiR) examination, comparing its performance to that of human candidates.
Materials and methods
Results from 42 EDiR candidates across Europe were compared to those from 26 fourth-year medical students who answered exclusively using the ChatGPT-4o in a prospective study (October 2024). The challenge consisted of 52 recall or understanding-based EDiR multiple-response questions, all without visual inputs.
Results
The GPT-4o achieved a mean score of 82.1 ± 3.0%, significantly outperforming the EDiR candidates with 49.4 ± 10.5% (
p
< 0.0001). In particular, chatGPT-4o demonstrated higher true positive rates while maintaining lower false positive rates compared to EDiR candidates, with a higher accuracy rate in all radiology subspecialties (
p
< 0.0001) except informatics (
p
= 0.20). There was near-perfect agreement between GPT-4 responses (κ = 0.872) and moderate agreement among EDiR participants (κ = 0.334). Exit surveys revealed that all participants used the copy-and-paste feature, and 73% submitted additional questions to clarify responses.
Conclusions
GPT-4o significantly outperformed human candidates in low-order, text-based EDiR multiple-response questions, demonstrating higher accuracy and reliability. These results highlight GPT-4o’s potential in answering text-based radiology questions. Further research is necessary to investigate its performance across different question formats and candidate populations to ensure broader applicability and reliability.
Critical relevance statement
GPT-4o significantly outperforms human candidates in factual radiology text-based questions in the EDiR, excelling especially in identifying correct responses, with a higher accuracy rate compared to radiologists.
Key Points
In EDiR text-based questions, ChatGPT-4o scored higher (82%) than EDiR participants (49%).
Compared to radiologists, GPT-4o excelled in identifying correct responses.
GPT-4o responses demonstrated higher agreement (κ = 0.87) compared to EDiR candidates (κ = 0.33).
Graphical Abstract
This website uses cookies to ensure you get the best experience on our website.