Journal of Surgical Oncology

, volume 131 , issue 8 , pages 1562-1570

Assessment of Challenging Oncologic Cases: A Comparative Analysis Between ChatGPT, Gemini, and a Multidisciplinary Tumor Board

Luis A. Hernández-Flores ¹

José B. López‐Martínez ¹

Jesús J. Rosales‐de‐la‐Rosa ¹

Daniel Aillaud De Uriarte ^{2, 3}

S. Contreras ¹

Rubén Cortés-González ¹

Hide authors affiliations Show authors affiliations: 3 affiliations

Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán Mexico City Mexico |

Harvard Medical School, Center for Bioethics Boston Massachusetts USA |

The University of Texas Medical Branch (UTMB) Galveston Texas USA |

Publication type: Journal Article

Publication date: 2025-02-12

Wiley

Journal of Surgical Oncology

scimago Q1

wos Q2

SJR: 0.741

CiteScore: 4.2

Impact factor: 1.9

ISSN: 00224790, 10969098, 26743000

DOI: 10.1002/jso.28121

Copy DOI

Abstract

ABSTRACT

Introduction

Since its introduction in 2022, public‐access conversational AI, exemplified by ChatGPT and Gemini, has been increasingly utilized in medical decision‐making, though its impact is questionable. This study aims to evaluate its efficacy in assessing complex oncologic cases compared to a multidisciplinary tumor board (MTB) comprising experts from various specialties.

Methods

A 2‐year retrospective analysis was conducted on 98 oncological cases at a reference medical center in Mexico City. A MTB comprising surgical oncologists, medical oncologists, radio‐oncologists, pathologists, among others, reviewed and discussed each case to determine management strategies. We evaluated four key decision points, dichotomized as either affirmative or negative: the need for new imaging studies, radiation therapy, chemotherapy, and surgery. Comprehensive medical documentation accompanied each case. We then compared AI's decisions with those of the MTB using the same criteria and conducted a Cohen's Kappa test to assess agreement.

Results

Agreement between ChatGPT (4o) and Gemini (1.5 Flash), and the MTB ranged from none to slight for additional imaging studies (Gemini: κ = 0.100, p = 0.087; ChatGPT 4o: κ = 0.024, p = 0.592) and chemotherapy (Gemini: κ = 0.089, p = 0.316; ChatGPT 4o: κ = 0.336, p = 0.001). Moderate agreement was observed for decisions regarding surgery (Gemini: κ = 0.194, p = 0.046; ChatGPT 4o: κ = 0.467, p = < 0.001) and radiotherapy (Gemini: κ = 0.214, p = 0.012; ChatGPT 4o: κ = 0.525, p = < 0.001).

Conclusions

Both ChatGPT and Gemini showed moderate agreement with the multidisciplinary tumor board on decisions regarding surgery and radiotherapy. ChatGPT also showed moderate agreement in chemotherapy, but further assessment is needed for other interventions. ChatGPT proved to be superior to Gemini in most key points. The potential of these public access AI in oncology warrants continued exploration to refine its utility in clinical practice.

Found

Top-30

Journals

	1
Journal of Surgical Oncology	Journal of Surgical Oncology, 1, 12.5% Journal of Surgical Oncology 1 publication, 12.5%
Melanoma Research	Melanoma Research, 1, 12.5% Melanoma Research 1 publication, 12.5%
Journal of Cancer Research and Clinical Oncology	Journal of Cancer Research and Clinical Oncology, 1, 12.5% Journal of Cancer Research and Clinical Oncology 1 publication, 12.5%
Diagnostics	Diagnostics, 1, 12.5% Diagnostics 1 publication, 12.5%
Scientific Reports	Scientific Reports, 1, 12.5% Scientific Reports 1 publication, 12.5%
Journal of Medical Systems	Journal of Medical Systems, 1, 12.5% Journal of Medical Systems 1 publication, 12.5%
European urology oncology	European urology oncology, 1, 12.5% European urology oncology 1 publication, 12.5%
Vascular Medicine	Vascular Medicine, 1, 12.5% Vascular Medicine 1 publication, 12.5%
	1

Publishers

	1 2 3
Springer Nature	Springer Nature, 3, 37.5% Springer Nature 3 publications, 37.5%
Wiley	Wiley, 1, 12.5% Wiley 1 publication, 12.5%
Ovid Technologies (Wolters Kluwer Health)	Ovid Technologies (Wolters Kluwer Health), 1, 12.5% Ovid Technologies (Wolters Kluwer Health) 1 publication, 12.5%
MDPI	MDPI, 1, 12.5% MDPI 1 publication, 12.5%
Elsevier	Elsevier, 1, 12.5% Elsevier 1 publication, 12.5%
SAGE	SAGE, 1, 12.5% SAGE 1 publication, 12.5%
	1 2 3

We do not take into account publications without a DOI.
Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.

Metrics

Cite this

GOST |

Cite this

GOST Copy

Hernández-Flores L. A. et al. Assessment of Challenging Oncologic Cases: A Comparative Analysis Between ChatGPT, Gemini, and a Multidisciplinary Tumor Board // Journal of Surgical Oncology. 2025. Vol. 131. No. 8. pp. 1562-1570.

GOST all authors (up to 50) Copy

Hernández-Flores L. A., López‐Martínez J. B., Rosales‐de‐la‐Rosa J. J., Aillaud De Uriarte D., Contreras S., Cortés-González R. Assessment of Challenging Oncologic Cases: A Comparative Analysis Between ChatGPT, Gemini, and a Multidisciplinary Tumor Board // Journal of Surgical Oncology. 2025. Vol. 131. No. 8. pp. 1562-1570.

RIS |

Cite this

RIS Copy

TY - JOUR

DO - 10.1002/jso.28121

UR - https://onlinelibrary.wiley.com/doi/10.1002/jso.28121

TI - Assessment of Challenging Oncologic Cases: A Comparative Analysis Between ChatGPT, Gemini, and a Multidisciplinary Tumor Board

T2 - Journal of Surgical Oncology

AU - Hernández-Flores, Luis A.

AU - López‐Martínez, José B.

AU - Rosales‐de‐la‐Rosa, Jesús J.

AU - Aillaud De Uriarte, Daniel

AU - Contreras, S.

AU - Cortés-González, Rubén

PY - 2025

DA - 2025/02/12

PB - Wiley

SP - 1562-1570

IS - 8

VL - 131

SN - 0022-4790

SN - 1096-9098

SN - 2674-3000

ER -

BibTex |

Cite this

BibTex (up to 50 authors) Copy

@article{2025_Hernández-Flores,

author = {Luis A. Hernández-Flores and José B. López‐Martínez and Jesús J. Rosales‐de‐la‐Rosa and Daniel Aillaud De Uriarte and S. Contreras and Rubén Cortés-González},

title = {Assessment of Challenging Oncologic Cases: A Comparative Analysis Between ChatGPT, Gemini, and a Multidisciplinary Tumor Board},

journal = {Journal of Surgical Oncology},

year = {2025},

volume = {131},

publisher = {Wiley},

month = {feb},

url = {https://onlinelibrary.wiley.com/doi/10.1002/jso.28121},

number = {8},

pages = {1562--1570},

doi = {10.1002/jso.28121}

}

MLA

Cite this

MLA Copy

Hernández-Flores, Luis A., et al. “Assessment of Challenging Oncologic Cases: A Comparative Analysis Between ChatGPT, Gemini, and a Multidisciplinary Tumor Board.” Journal of Surgical Oncology, vol. 131, no. 8, Feb. 2025, pp. 1562-1570. https://onlinelibrary.wiley.com/doi/10.1002/jso.28121.

Publisher

Wiley

Journal

Journal of Surgical Oncology

scimago Q1

wos Q2

SJR

0.741

CiteScore

4.2

Impact factor

1.9

ISSN

00224790 (Print)

10969098 (Electronic)

26743000 (Print, Electronic)