Asset Details

MbrlCatalogueTitleDetail

Do you wish to reserve the book?

Human versus artificial intelligence in oral pathology diagnosis: a comparative study of ChatGPT, Grok, and MANUS

by Madfa, Ahmed A. , Alshammari, Abdullah F. , Anazi, Bassam Ali

in 631/114 / 639/705 / 692/308 / 692/700 / Accuracy / Artificial Intelligence / Chatbots / Comparative studies / Decision making / Diagnostic accuracy / Digital pathology / Generative Artificial Intelligence / Histopathology / Humanities and Social Sciences / Humans / Kappa coefficient / Large Language Models / Mathematical models / multidisciplinary / Oral pathology / Pathology / Pathology, Oral - methods / Quality assurance / Science / Science (multidisciplinary) / Statistical analysis

2026

Yes Please

Hey, we have placed the reservation for you!

By the way, why not check out events that you can attend while you pick your title.

Oops! Something went wrong.

Looks like we were not able to place the reservation. Kindly try again later.

Are you sure you want to remove the book from the shelf?

Human versus artificial intelligence in oral pathology diagnosis: a comparative study of ChatGPT, Grok, and MANUS

by Madfa, Ahmed A. , Alshammari, Abdullah F. , Anazi, Bassam Ali

2026

Confirm

Do you wish to request the book?

Human versus artificial intelligence in oral pathology diagnosis: a comparative study of ChatGPT, Grok, and MANUS

by Madfa, Ahmed A. , Alshammari, Abdullah F. , Anazi, Bassam Ali

2026

Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy

How would you like to get it?

Submit

We have requested the book for you!

Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.

Oops! Something went wrong.

Looks like we were not able to place your request. Kindly try again later.

Journal Article

Human versus artificial intelligence in oral pathology diagnosis: a comparative study of ChatGPT, Grok, and MANUS

Madfa, Ahmed A.,

Alshammari, Abdullah F.,

Anazi, Bassam Ali

2026

Overview

Artificial intelligence (AI) integration in diagnostic medicine has advanced accuracy and efficiency, particularly in pathology. This study assessed the diagnostic performance of three large language models (LLMs)—ChatGPT (GPT-4-turbo), Grok (xAI), and MANUS—in interpreting histopathology slides of oral lesions. A comparative diagnostic study was conducted using 100 high-resolution slides representing diverse oral pathologies. Images were sourced from a validated textbook and reviewed by two board-certified oral pathologists who provided consensus diagnoses. Each slide was analysed twice by the three AI models using standardized prompts. Diagnostic accuracy, intra-model consistency, inter-model concordance, and agreement with human experts were evaluated using descriptive statistics, Cohen’s kappa, McNemar’s test, and chi-square analysis. All AI models demonstrated high diagnostic accuracy. In the second round, Grok achieved the highest accuracy (97%), followed by MANUS (96%) and ChatGPT (94%). ChatGPT showed the highest intra-model consistency (κ = 0.918), while MANUS and Grok displayed substantial agreement (κ = 0.790 and 0.740). Expert pathologists achieved 98% accuracy. Comparisons between AI models and human diagnoses showed moderate to substantial agreement, with MANUS most aligned with experts. Most misclassifications occurred in histologically ambiguous cases, with no significant differences between AI models. Multimodal LLMs demonstrated strong diagnostic capabilities, consistency, and alignment with expert reasoning in oral histopathology interpretation. Grok was the most accurate, ChatGPT the most consistent, and MANUS the most expert-aligned. These findings support AI integration into digital pathology for diagnostic support, education, and quality assurance, with further validation in clinical datasets recommended.

Share this book

Add to My Shelf

Publisher

Nature Publishing Group UK,Nature Publishing Group,Nature Portfolio

Subject

631/114

/ 639/705

/ 692/308

/ 692/700

/ Accuracy

/ Artificial Intelligence

/ Chatbots