Open Access
Open access
Science, volume 379, issue 6637, pages 1123-1130

Evolutionary-scale prediction of atomic-level protein structure with a language model

Zeming Lin 1, 2
Halil Akin 1
Roshan Rao 1
Brian Hie 1, 3
Zhongkai Zhu 1
Wenting Lu 1
Nikita Smetanin 1
Robert Verkuil 1
Ori Kabeli 1
Yaniv Shmueli 1
Allan dos Santos Costa 4
Maryam Fazel-Zarandi 1
Tom Sercu 1
S Candido 1
Alexander Rives 1, 2
Publication typeJournal Article
Publication date2023-03-17
Journal: Science
Quartile SCImago
Q1
Quartile WOS
Q1
Impact factor56.9
ISSN00368075, 10959203
Multidisciplinary
Abstract

Recent advances in machine learning have leveraged evolutionary information in multiple sequence alignments to predict protein structure. We demonstrate direct inference of full atomic-level protein structure from primary sequence using a large language model. As language models of protein sequences are scaled up to 15 billion parameters, an atomic-resolution picture of protein structure emerges in the learned representations. This results in an order-of-magnitude acceleration of high-resolution structure prediction, which enables large-scale structural characterization of metagenomic proteins. We apply this capability to construct the ESM Metagenomic Atlas by predicting structures for >617 million metagenomic protein sequences, including >225 million that are predicted with high confidence, which gives a view into the vast breadth and diversity of natural proteins.

Top-30

Journals

5
10
15
20
25
30
35
40
45
Bioinformatics
42 publications, 3.16%
Briefings in Bioinformatics
41 publications, 3.09%
Nature Communications
36 publications, 2.71%
Journal of Chemical Information and Modeling
27 publications, 2.03%
Proceedings of the National Academy of Sciences of the United States of America
18 publications, 1.36%
Computational and Structural Biotechnology Journal
17 publications, 1.28%
PLoS Computational Biology
16 publications, 1.2%
Protein Science
16 publications, 1.2%
Nucleic Acids Research
16 publications, 1.2%
International Journal of Molecular Sciences
15 publications, 1.13%
Current Opinion in Structural Biology
15 publications, 1.13%
Lecture Notes in Computer Science
15 publications, 1.13%
Proteins: Structure, Function and Genetics
14 publications, 1.05%
Nature
14 publications, 1.05%
Nature Machine Intelligence
12 publications, 0.9%
International Journal of Biological Macromolecules
12 publications, 0.9%
Nature Methods
11 publications, 0.83%
Nature Biotechnology
10 publications, 0.75%
Science
9 publications, 0.68%
Cell
9 publications, 0.68%
Scientific Reports
9 publications, 0.68%
BMC Bioinformatics
8 publications, 0.6%
Communications Biology
7 publications, 0.53%
Molecules
7 publications, 0.53%
Computers in Biology and Medicine
6 publications, 0.45%
Journal of Cheminformatics
6 publications, 0.45%
Cold Spring Harbor perspectives in biology
6 publications, 0.45%
Cell Systems
6 publications, 0.45%
Advanced Science
6 publications, 0.45%
5
10
15
20
25
30
35
40
45

Publishers

50
100
150
200
250
300
350
400
Cold Spring Harbor Laboratory
359 publications, 27.03%
Springer Nature
197 publications, 14.83%
Elsevier
171 publications, 12.88%
Oxford University Press
121 publications, 9.11%
Wiley
72 publications, 5.42%
American Chemical Society (ACS)
57 publications, 4.29%
MDPI
56 publications, 4.22%
Public Library of Science (PLoS)
22 publications, 1.66%
Frontiers Media S.A.
21 publications, 1.58%
Proceedings of the National Academy of Sciences (PNAS)
18 publications, 1.36%
Institute of Electrical and Electronics Engineers (IEEE)
17 publications, 1.28%
American Association for the Advancement of Science (AAAS)
13 publications, 0.98%
Taylor & Francis
10 publications, 0.75%
Royal Society of Chemistry (RSC)
9 publications, 0.68%
Research Square Platform LLC
7 publications, 0.53%
International Union of Crystallography (IUCr)
6 publications, 0.45%
American Society for Microbiology
6 publications, 0.45%
eLife Sciences Publications
6 publications, 0.45%
Walter de Gruyter
4 publications, 0.3%
Annual Reviews
3 publications, 0.23%
AIP Publishing
3 publications, 0.23%
Mary Ann Liebert
2 publications, 0.15%
The American Association of Immunologists
2 publications, 0.15%
Association for Computing Machinery (ACM)
2 publications, 0.15%
PeerJ
2 publications, 0.15%
European Molecular Biology Organization
1 publication, 0.08%
Massachusetts Medical Society
1 publication, 0.08%
1 publication, 0.08%
Portland Press
1 publication, 0.08%
50
100
150
200
250
300
350
400
  • We do not take into account publications without a DOI.
  • Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
Share
Cite this
GOST |
Cite this
GOST Copy
Lin Z. et al. Evolutionary-scale prediction of atomic-level protein structure with a language model // Science. 2023. Vol. 379. No. 6637. pp. 1123-1130.
GOST all authors (up to 50) Copy
Lin Z., Akin H., Rao R., Hie B., Zhu Z., Lu W., Smetanin N., Verkuil R., Kabeli O., Shmueli Y., dos Santos Costa A., Fazel-Zarandi M., Sercu T., Candido S., Rives A. Evolutionary-scale prediction of atomic-level protein structure with a language model // Science. 2023. Vol. 379. No. 6637. pp. 1123-1130.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1126/science.ade2574
UR - https://doi.org/10.1126/science.ade2574
TI - Evolutionary-scale prediction of atomic-level protein structure with a language model
T2 - Science
AU - Lin, Zeming
AU - Akin, Halil
AU - Rao, Roshan
AU - Hie, Brian
AU - Zhu, Zhongkai
AU - Lu, Wenting
AU - Smetanin, Nikita
AU - Verkuil, Robert
AU - Kabeli, Ori
AU - Shmueli, Yaniv
AU - dos Santos Costa, Allan
AU - Fazel-Zarandi, Maryam
AU - Sercu, Tom
AU - Candido, S
AU - Rives, Alexander
PY - 2023
DA - 2023/03/17
PB - American Association for the Advancement of Science (AAAS)
SP - 1123-1130
IS - 6637
VL - 379
SN - 0036-8075
SN - 1095-9203
ER -
BibTex |
Cite this
BibTex Copy
@article{2023_Lin,
author = {Zeming Lin and Halil Akin and Roshan Rao and Brian Hie and Zhongkai Zhu and Wenting Lu and Nikita Smetanin and Robert Verkuil and Ori Kabeli and Yaniv Shmueli and Allan dos Santos Costa and Maryam Fazel-Zarandi and Tom Sercu and S Candido and Alexander Rives},
title = {Evolutionary-scale prediction of atomic-level protein structure with a language model},
journal = {Science},
year = {2023},
volume = {379},
publisher = {American Association for the Advancement of Science (AAAS)},
month = {mar},
url = {https://doi.org/10.1126/science.ade2574},
number = {6637},
pages = {1123--1130},
doi = {10.1126/science.ade2574}
}
MLA
Cite this
MLA Copy
Lin, Zeming, et al. “Evolutionary-scale prediction of atomic-level protein structure with a language model.” Science, vol. 379, no. 6637, Mar. 2023, pp. 1123-1130. https://doi.org/10.1126/science.ade2574.
Found error?
Profiles