Evolutionary-scale prediction of atomic-level protein structure with a language model
Recent advances in machine learning have leveraged evolutionary information in multiple sequence alignments to predict protein structure. We demonstrate direct inference of full atomic-level protein structure from primary sequence using a large language model. As language models of protein sequences are scaled up to 15 billion parameters, an atomic-resolution picture of protein structure emerges in the learned representations. This results in an order-of-magnitude acceleration of high-resolution structure prediction, which enables large-scale structural characterization of metagenomic proteins. We apply this capability to construct the ESM Metagenomic Atlas by predicting structures for >617 million metagenomic protein sequences, including >225 million that are predicted with high confidence, which gives a view into the vast breadth and diversity of natural proteins.
Top-30
Journals
5
10
15
20
25
30
35
40
45
|
|
Bioinformatics
42 publications, 3.16%
|
|
Briefings in Bioinformatics
41 publications, 3.09%
|
|
Nature Communications
36 publications, 2.71%
|
|
Journal of Chemical Information and Modeling
27 publications, 2.03%
|
|
Proceedings of the National Academy of Sciences of the United States of America
18 publications, 1.36%
|
|
Computational and Structural Biotechnology Journal
17 publications, 1.28%
|
|
PLoS Computational Biology
16 publications, 1.2%
|
|
Protein Science
16 publications, 1.2%
|
|
Nucleic Acids Research
16 publications, 1.2%
|
|
International Journal of Molecular Sciences
15 publications, 1.13%
|
|
Current Opinion in Structural Biology
15 publications, 1.13%
|
|
Lecture Notes in Computer Science
15 publications, 1.13%
|
|
Proteins: Structure, Function and Genetics
14 publications, 1.05%
|
|
Nature
14 publications, 1.05%
|
|
Nature Machine Intelligence
12 publications, 0.9%
|
|
International Journal of Biological Macromolecules
12 publications, 0.9%
|
|
Nature Methods
11 publications, 0.83%
|
|
Nature Biotechnology
10 publications, 0.75%
|
|
Science
9 publications, 0.68%
|
|
Cell
9 publications, 0.68%
|
|
Scientific Reports
9 publications, 0.68%
|
|
BMC Bioinformatics
8 publications, 0.6%
|
|
Communications Biology
7 publications, 0.53%
|
|
Molecules
7 publications, 0.53%
|
|
Computers in Biology and Medicine
6 publications, 0.45%
|
|
Journal of Cheminformatics
6 publications, 0.45%
|
|
Cold Spring Harbor perspectives in biology
6 publications, 0.45%
|
|
Cell Systems
6 publications, 0.45%
|
|
Advanced Science
6 publications, 0.45%
|
|
5
10
15
20
25
30
35
40
45
|
Publishers
50
100
150
200
250
300
350
400
|
|
Cold Spring Harbor Laboratory
359 publications, 27.03%
|
|
Springer Nature
197 publications, 14.83%
|
|
Elsevier
171 publications, 12.88%
|
|
Oxford University Press
121 publications, 9.11%
|
|
Wiley
72 publications, 5.42%
|
|
American Chemical Society (ACS)
57 publications, 4.29%
|
|
MDPI
56 publications, 4.22%
|
|
Public Library of Science (PLoS)
22 publications, 1.66%
|
|
Frontiers Media S.A.
21 publications, 1.58%
|
|
Proceedings of the National Academy of Sciences (PNAS)
18 publications, 1.36%
|
|
Institute of Electrical and Electronics Engineers (IEEE)
17 publications, 1.28%
|
|
American Association for the Advancement of Science (AAAS)
13 publications, 0.98%
|
|
Taylor & Francis
10 publications, 0.75%
|
|
Royal Society of Chemistry (RSC)
9 publications, 0.68%
|
|
Research Square Platform LLC
7 publications, 0.53%
|
|
International Union of Crystallography (IUCr)
6 publications, 0.45%
|
|
American Society for Microbiology
6 publications, 0.45%
|
|
eLife Sciences Publications
6 publications, 0.45%
|
|
Walter de Gruyter
4 publications, 0.3%
|
|
Annual Reviews
3 publications, 0.23%
|
|
AIP Publishing
3 publications, 0.23%
|
|
Mary Ann Liebert
2 publications, 0.15%
|
|
The American Association of Immunologists
2 publications, 0.15%
|
|
Association for Computing Machinery (ACM)
2 publications, 0.15%
|
|
PeerJ
2 publications, 0.15%
|
|
European Molecular Biology Organization
1 publication, 0.08%
|
|
Massachusetts Medical Society
1 publication, 0.08%
|
|
1 publication, 0.08%
|
|
Portland Press
1 publication, 0.08%
|
|
50
100
150
200
250
300
350
400
|
- We do not take into account publications without a DOI.
- Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
- Statistics recalculated weekly.