Open Access
Open access
Science, volume 379, issue 6637, pages 1123-1130

Evolutionary-scale prediction of atomic-level protein structure with a language model

Zeming Lin 1, 2
Halil Akin 1
Roshan Rao 1
Brian Hie 1, 3
Zhongkai Zhu 1
Wenting Lu 1
Nikita Smetanin 1
Robert Verkuil 1
Ori Kabeli 1
Yaniv Shmueli 1
Allan dos Santos Costa 4
Maryam Fazel-Zarandi 1
Tom Sercu 1
S Candido 1
Alexander Rives 1, 2
Show full list: 15 authors
Publication typeJournal Article
Publication date2023-03-17
Journal: Science
scimago Q1
SJR11.902
CiteScore61.1
Impact factor44.7
ISSN00368075, 10959203
Multidisciplinary
Abstract

Recent advances in machine learning have leveraged evolutionary information in multiple sequence alignments to predict protein structure. We demonstrate direct inference of full atomic-level protein structure from primary sequence using a large language model. As language models of protein sequences are scaled up to 15 billion parameters, an atomic-resolution picture of protein structure emerges in the learned representations. This results in an order-of-magnitude acceleration of high-resolution structure prediction, which enables large-scale structural characterization of metagenomic proteins. We apply this capability to construct the ESM Metagenomic Atlas by predicting structures for >617 million metagenomic protein sequences, including >225 million that are predicted with high confidence, which gives a view into the vast breadth and diversity of natural proteins.

Found 

Top-30

Journals

10
20
30
40
50
60
70
80
90
10
20
30
40
50
60
70
80
90

Publishers

100
200
300
400
500
600
100
200
300
400
500
600
  • We do not take into account publications without a DOI.
  • Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Share
Cite this
GOST | RIS | BibTex | MLA
Found error?
Profiles