Open Access
Open access
Scientific Reports, volume 13, issue 1, publication number 4171

Medical image captioning via generative pretrained transformers

Selivanov Alexander 1, 2
Rogov Oleg Y 1
Chesakov Daniil 1, 3
Shelmanov Artem 1, 3
Fedulova Irina 2
Dylov Dmitry V. 1
Publication typeJournal Article
Publication date2023-03-13
Quartile SCImago
Q1
Quartile WOS
Q2
Impact factor4.6
ISSN20452322
Multidisciplinary
Abstract

The proposed model for automatic clinical image caption generation combines the analysis of radiological scans with structured patient information from the textual records. It uses two language models, the Show-Attend-Tell and the GPT-3, to generate comprehensive and descriptive radiology records. The generated textual summary contains essential information about pathologies found, their location, along with the 2D heatmaps that localize each pathology on the scans. The model has been tested on two medical datasets, the Open-I, MIMIC-CXR, and the general-purpose MS-COCO, and the results measured with natural language assessment metrics demonstrated its efficient applicability to chest X-ray image captioning.

Citations by journals

1
2
Applied Sciences (Switzerland)
Applied Sciences (Switzerland), 2, 15.38%
Applied Sciences (Switzerland)
2 publications, 15.38%
European Radiology
European Radiology, 1, 7.69%
European Radiology
1 publication, 7.69%
JAMA network open
JAMA network open, 1, 7.69%
JAMA network open
1 publication, 7.69%
ACM Computing Surveys
ACM Computing Surveys, 1, 7.69%
ACM Computing Surveys
1 publication, 7.69%
Engineering Reports
Engineering Reports, 1, 7.69%
Engineering Reports
1 publication, 7.69%
Library Hi Tech
Library Hi Tech, 1, 7.69%
Library Hi Tech
1 publication, 7.69%
Frontiers in Neurorobotics
Frontiers in Neurorobotics, 1, 7.69%
Frontiers in Neurorobotics
1 publication, 7.69%
BMC Medical Informatics and Decision Making
BMC Medical Informatics and Decision Making, 1, 7.69%
BMC Medical Informatics and Decision Making
1 publication, 7.69%
Multimedia Tools and Applications
Multimedia Tools and Applications, 1, 7.69%
Multimedia Tools and Applications
1 publication, 7.69%
Frontiers in Pharmacology
Frontiers in Pharmacology, 1, 7.69%
Frontiers in Pharmacology
1 publication, 7.69%
IEEE Access
IEEE Access, 1, 7.69%
IEEE Access
1 publication, 7.69%
1
2

Citations by publishers

1
2
3
Springer Nature
Springer Nature, 3, 23.08%
Springer Nature
3 publications, 23.08%
Multidisciplinary Digital Publishing Institute (MDPI)
Multidisciplinary Digital Publishing Institute (MDPI), 2, 15.38%
Multidisciplinary Digital Publishing Institute (MDPI)
2 publications, 15.38%
Frontiers Media S.A.
Frontiers Media S.A., 2, 15.38%
Frontiers Media S.A.
2 publications, 15.38%
IEEE
IEEE, 2, 15.38%
IEEE
2 publications, 15.38%
American Medical Association (AMA)
American Medical Association (AMA), 1, 7.69%
American Medical Association (AMA)
1 publication, 7.69%
Association for Computing Machinery (ACM)
Association for Computing Machinery (ACM), 1, 7.69%
Association for Computing Machinery (ACM)
1 publication, 7.69%
Wiley
Wiley, 1, 7.69%
Wiley
1 publication, 7.69%
Emerald
Emerald, 1, 7.69%
Emerald
1 publication, 7.69%
1
2
3
  • We do not take into account publications that without a DOI.
  • Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
  • Statistics recalculated weekly.
Metrics
Share
Cite this
GOST |
Cite this
GOST Copy
Selivanov A. et al. Medical image captioning via generative pretrained transformers // Scientific Reports. 2023. Vol. 13. No. 1. 4171
GOST all authors (up to 50) Copy
Selivanov A., Rogov O. Y., Chesakov D., Shelmanov A., Fedulova I., Dylov D. V. Medical image captioning via generative pretrained transformers // Scientific Reports. 2023. Vol. 13. No. 1. 4171
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1038/s41598-023-31223-5
UR - https://doi.org/10.1038%2Fs41598-023-31223-5
TI - Medical image captioning via generative pretrained transformers
T2 - Scientific Reports
AU - Selivanov, Alexander
AU - Rogov, Oleg Y
AU - Chesakov, Daniil
AU - Shelmanov, Artem
AU - Fedulova, Irina
AU - Dylov, Dmitry V.
PY - 2023
DA - 2023/03/13 00:00:00
PB - Springer Nature
IS - 1
VL - 13
SN - 2045-2322
ER -
BibTex
Cite this
BibTex Copy
@article{2023_Selivanov,
author = {Alexander Selivanov and Oleg Y Rogov and Daniil Chesakov and Artem Shelmanov and Irina Fedulova and Dmitry V. Dylov},
title = {Medical image captioning via generative pretrained transformers},
journal = {Scientific Reports},
year = {2023},
volume = {13},
publisher = {Springer Nature},
month = {mar},
url = {https://doi.org/10.1038%2Fs41598-023-31223-5},
number = {1},
doi = {10.1038/s41598-023-31223-5}
}
Found error?