Evolutionary-scale prediction of atomic-level protein structure with a language model
Recent advances in machine learning have leveraged evolutionary information in multiple sequence alignments to predict protein structure. We demonstrate direct inference of full atomic-level protein structure from primary sequence using a large language model. As language models of protein sequences are scaled up to 15 billion parameters, an atomic-resolution picture of protein structure emerges in the learned representations. This results in an order-of-magnitude acceleration of high-resolution structure prediction, which enables large-scale structural characterization of metagenomic proteins. We apply this capability to construct the ESM Metagenomic Atlas by predicting structures for >617 million metagenomic protein sequences, including >225 million that are predicted with high confidence, which gives a view into the vast breadth and diversity of natural proteins.
Top-30
Journals
|
20
40
60
80
100
120
|
|
|
Briefings in Bioinformatics
106 publications, 3.24%
|
|
|
Journal of Chemical Information and Modeling
102 publications, 3.12%
|
|
|
Bioinformatics
97 publications, 2.96%
|
|
|
Nature Communications
96 publications, 2.93%
|
|
|
bioRxiv
84 publications, 2.57%
|
|
|
Computational and Structural Biotechnology Journal
42 publications, 1.28%
|
|
|
Current Opinion in Structural Biology
39 publications, 1.19%
|
|
|
Protein Science
39 publications, 1.19%
|
|
|
PLoS Computational Biology
35 publications, 1.07%
|
|
|
Nucleic Acids Research
35 publications, 1.07%
|
|
|
Nature Machine Intelligence
34 publications, 1.04%
|
|
|
Proceedings of the National Academy of Sciences of the United States of America
34 publications, 1.04%
|
|
|
Nature Methods
32 publications, 0.98%
|
|
|
Proteins: Structure, Function and Genetics
31 publications, 0.95%
|
|
|
Scientific Reports
31 publications, 0.95%
|
|
|
Methods in Molecular Biology
31 publications, 0.95%
|
|
|
Lecture Notes in Computer Science
29 publications, 0.89%
|
|
|
International Journal of Molecular Sciences
28 publications, 0.86%
|
|
|
International Journal of Biological Macromolecules
26 publications, 0.79%
|
|
|
Cell Systems
24 publications, 0.73%
|
|
|
Journal of Molecular Biology
23 publications, 0.7%
|
|
|
Nature
22 publications, 0.67%
|
|
|
Advanced Science
20 publications, 0.61%
|
|
|
Computers in Biology and Medicine
19 publications, 0.58%
|
|
|
Science
18 publications, 0.55%
|
|
|
eLife
18 publications, 0.55%
|
|
|
Nature Biotechnology
16 publications, 0.49%
|
|
|
BMC Bioinformatics
16 publications, 0.49%
|
|
|
mAbs
16 publications, 0.49%
|
|
|
20
40
60
80
100
120
|
Publishers
|
100
200
300
400
500
600
700
800
900
|
|
|
Cold Spring Harbor Laboratory
889 publications, 27.16%
|
|
|
Springer Nature
523 publications, 15.98%
|
|
|
Elsevier
504 publications, 15.4%
|
|
|
Oxford University Press
291 publications, 8.89%
|
|
|
American Chemical Society (ACS)
190 publications, 5.81%
|
|
|
Wiley
174 publications, 5.32%
|
|
|
MDPI
105 publications, 3.21%
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
101 publications, 3.09%
|
|
|
Public Library of Science (PLoS)
51 publications, 1.56%
|
|
|
Frontiers Media S.A.
46 publications, 1.41%
|
|
|
Proceedings of the National Academy of Sciences (PNAS)
34 publications, 1.04%
|
|
|
American Association for the Advancement of Science (AAAS)
33 publications, 1.01%
|
|
|
Taylor & Francis
28 publications, 0.86%
|
|
|
Royal Society of Chemistry (RSC)
26 publications, 0.79%
|
|
|
eLife Sciences Publications
18 publications, 0.55%
|
|
|
Association for Computing Machinery (ACM)
17 publications, 0.52%
|
|
|
American Society for Microbiology
16 publications, 0.49%
|
|
|
International Union of Crystallography (IUCr)
14 publications, 0.43%
|
|
|
Annual Reviews
9 publications, 0.27%
|
|
|
American Physical Society (APS)
9 publications, 0.27%
|
|
|
Science in China Press
9 publications, 0.27%
|
|
|
AIP Publishing
7 publications, 0.21%
|
|
|
Research Square Platform LLC
7 publications, 0.21%
|
|
|
IOP Publishing
6 publications, 0.18%
|
|
|
Walter de Gruyter
5 publications, 0.15%
|
|
|
PeerJ
5 publications, 0.15%
|
|
|
Mary Ann Liebert
4 publications, 0.12%
|
|
|
SAGE
4 publications, 0.12%
|
|
|
The Royal Society
4 publications, 0.12%
|
|
|
100
200
300
400
500
600
700
800
900
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.