Open Access

medRxiv

Large language models for conducting systematic reviews: on the rise, but not yet ready for use - a scoping review

Тип публикации: Posted Content

Дата публикации: 2024-12-24

openRxiv

medRxiv

SJR: —

CiteScore: —

Impact factor: —

ISSN:

DOI: 10.1101/2024.12.19.24319326

Скопировать DOI

Краткое описание

ABSTRACT

Background

Machine learning (ML) promises versatile help in the creation of systematic reviews (SRs). Recently, further developments in the form of large language models (LLMs) and their application in SR conduct attracted attention.

Objective

To provide an overview of ML and specifically LLM applications in SR conduct in health research.

Study design

We systematically searched MEDLINE, Web of Science, IEEEXplore, ACM Digital Library, Europe PMC (preprints), Google Scholar, and conducted an additional hand search (last search: 26 February 2024). We included scientific articles in English or German, published from April 2021 onwards, building upon the results of a mapping review with a related research question. Two reviewers independently screened studies for eligibility; after piloting, one reviewer extracted data, checked by another.

Results

Our database search yielded 8054 hits, and we identified 33 articles from our hand search. Of the 196 included reports, 159 described more traditional ML techniques, 37 focused on LLMs. LLM approaches covered 10 of 13 defined SR steps, most frequently literature search (n=15, 41%), study selection (n=14, 38%), and data extraction (n=11, 30%). The mostly recurring LLM was GPT (n=33, 89%). Validation studies were predominant (n=21, 57%). In half of the studies, authors evaluated LLM use as promising (n=20, 54%), one quarter as neutral (n=9, 24%) and one fifth as non-promising (n=8, 22%).

Conclusions

Although LLMs show promise in supporting SR creation, fully established or validated applications are often lacking. The rapid increase in research on LLMs for evidence synthesis production highlights their growing relevance.

HIGHLIGHTS

Machine learning (ML) offers promising support for systematic review (SR) creation.

GPT was the most commonly used large language model (LLM) to support SR production.

LLM application included 10 of 13 defined SR steps, most often literature search.

Validation studies predominated, but fully established LLM applications are rare.

LLM research for SR conduct is surging, highlighting the increasing relevance.

Для доступа к списку цитирований публикации необходимо авторизоваться.

Войти с ORCID

Топ-30

Журналы

	1
Journal of Dentistry	Journal of Dentistry, 1, 100% Journal of Dentistry 1 публикация, 100%
	1

Издатели

	1
Elsevier	Elsevier, 1, 100% Elsevier 1 публикация, 100%
	1

Мы не учитываем публикации, у которых нет DOI.
Статистика публикаций обновляется еженедельно.

Вы ученый?

Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.

Войти с ORCID

Метрики

Цитировать

ГОСТ |

Цитировать

ГОСТ Скопировать

Lieberum J. et al. Large language models for conducting systematic reviews: on the rise, but not yet ready for use - a scoping review // medRxiv. 2024.

ГОСТ со всеми авторами (до 50) Скопировать

Lieberum J., Toews M., Metzendorf M., Heilmeyer F. A., Siemens W., Haverkamp C., Böhringer D., Meerpohl J. J., Eisele A. Large language models for conducting systematic reviews: on the rise, but not yet ready for use - a scoping review // medRxiv. 2024.

RIS |

Цитировать

RIS Скопировать

TY - GENERIC

DO - 10.1101/2024.12.19.24319326

UR - http://medrxiv.org/lookup/doi/10.1101/2024.12.19.24319326

TI - Large language models for conducting systematic reviews: on the rise, but not yet ready for use - a scoping review

T2 - medRxiv

AU - Lieberum, Judith-Lisa

AU - Toews, Markus

AU - Metzendorf, Maria-Inti

AU - Heilmeyer, F. A.

AU - Siemens, Waldemar

AU - Haverkamp, Christian

AU - Böhringer, Daniel

AU - Meerpohl, Joerg J.

AU - Eisele, Angelika

PY - 2024

DA - 2024/12/24

PB - openRxiv

ER -

BibTex

Цитировать

BibTex (до 50 авторов) Скопировать

@article{2024_Lieberum,

author = {Judith-Lisa Lieberum and Markus Toews and Maria-Inti Metzendorf and F. A. Heilmeyer and Waldemar Siemens and Christian Haverkamp and Daniel Böhringer and Joerg J. Meerpohl and Angelika Eisele},

title = {Large language models for conducting systematic reviews: on the rise, but not yet ready for use - a scoping review},

journal = {medRxiv},

year = {2024},

publisher = {openRxiv},

month = {dec},

url = {http://medrxiv.org/lookup/doi/10.1101/2024.12.19.24319326},

doi = {10.1101/2024.12.19.24319326}

}

Издатель

openRxiv

Журнал

medRxiv

SJR

—

CiteScore

—

Impact factor

—

ISSN

Ошибка в публикации?