Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Human versus artificial intelligence in oral pathology diagnosis: a comparative study of ChatGPT, Grok, and MANUS
by
Madfa, Ahmed A.
, Alshammari, Abdullah F.
, Anazi, Bassam Ali
in
631/114
/ 639/705
/ 692/308
/ 692/700
/ Accuracy
/ Artificial Intelligence
/ Chatbots
/ Comparative studies
/ Decision making
/ Diagnostic accuracy
/ Digital pathology
/ Generative Artificial Intelligence
/ Histopathology
/ Humanities and Social Sciences
/ Humans
/ Kappa coefficient
/ Large language models
/ Mathematical models
/ multidisciplinary
/ Oral pathology
/ Pathology
/ Pathology, Oral - methods
/ Quality assurance
/ Science
/ Science (multidisciplinary)
/ Statistical analysis
2026
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Human versus artificial intelligence in oral pathology diagnosis: a comparative study of ChatGPT, Grok, and MANUS
by
Madfa, Ahmed A.
, Alshammari, Abdullah F.
, Anazi, Bassam Ali
in
631/114
/ 639/705
/ 692/308
/ 692/700
/ Accuracy
/ Artificial Intelligence
/ Chatbots
/ Comparative studies
/ Decision making
/ Diagnostic accuracy
/ Digital pathology
/ Generative Artificial Intelligence
/ Histopathology
/ Humanities and Social Sciences
/ Humans
/ Kappa coefficient
/ Large language models
/ Mathematical models
/ multidisciplinary
/ Oral pathology
/ Pathology
/ Pathology, Oral - methods
/ Quality assurance
/ Science
/ Science (multidisciplinary)
/ Statistical analysis
2026
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Human versus artificial intelligence in oral pathology diagnosis: a comparative study of ChatGPT, Grok, and MANUS
by
Madfa, Ahmed A.
, Alshammari, Abdullah F.
, Anazi, Bassam Ali
in
631/114
/ 639/705
/ 692/308
/ 692/700
/ Accuracy
/ Artificial Intelligence
/ Chatbots
/ Comparative studies
/ Decision making
/ Diagnostic accuracy
/ Digital pathology
/ Generative Artificial Intelligence
/ Histopathology
/ Humanities and Social Sciences
/ Humans
/ Kappa coefficient
/ Large language models
/ Mathematical models
/ multidisciplinary
/ Oral pathology
/ Pathology
/ Pathology, Oral - methods
/ Quality assurance
/ Science
/ Science (multidisciplinary)
/ Statistical analysis
2026
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Human versus artificial intelligence in oral pathology diagnosis: a comparative study of ChatGPT, Grok, and MANUS
Journal Article
Human versus artificial intelligence in oral pathology diagnosis: a comparative study of ChatGPT, Grok, and MANUS
2026
Request Book From Autostore
and Choose the Collection Method
Overview
Artificial intelligence (AI) integration in diagnostic medicine has advanced accuracy and efficiency, particularly in pathology. This study assessed the diagnostic performance of three large language models (LLMs)—ChatGPT (GPT-4-turbo), Grok (xAI), and MANUS—in interpreting histopathology slides of oral lesions. A comparative diagnostic study was conducted using 100 high-resolution slides representing diverse oral pathologies. Images were sourced from a validated textbook and reviewed by two board-certified oral pathologists who provided consensus diagnoses. Each slide was analysed twice by the three AI models using standardized prompts. Diagnostic accuracy, intra-model consistency, inter-model concordance, and agreement with human experts were evaluated using descriptive statistics, Cohen’s kappa, McNemar’s test, and chi-square analysis. All AI models demonstrated high diagnostic accuracy. In the second round, Grok achieved the highest accuracy (97%), followed by MANUS (96%) and ChatGPT (94%). ChatGPT showed the highest intra-model consistency (κ = 0.918), while MANUS and Grok displayed substantial agreement (κ = 0.790 and 0.740). Expert pathologists achieved 98% accuracy. Comparisons between AI models and human diagnoses showed moderate to substantial agreement, with MANUS most aligned with experts. Most misclassifications occurred in histologically ambiguous cases, with no significant differences between AI models. Multimodal LLMs demonstrated strong diagnostic capabilities, consistency, and alignment with expert reasoning in oral histopathology interpretation. Grok was the most accurate, ChatGPT the most consistent, and MANUS the most expert-aligned. These findings support AI integration into digital pathology for diagnostic support, education, and quality assurance, with further validation in clinical datasets recommended.
Publisher
Nature Publishing Group UK,Nature Publishing Group,Nature Portfolio
Subject
This website uses cookies to ensure you get the best experience on our website.