De novo protein design by citizen scientists

Online citizen science projects such as GalaxyZoo 1 , Eyewire 2 and Phylo 3 have proven very successful for data collection, annotation and processing, but for the most part have harnessed human pattern-recognition skills rather than human creativity. An exception is the game EteRNA 4 , in which gam...

Full description

Saved in:
Bibliographic Details
Published in:Nature (London) Vol. 570; no. 7761; pp. 390 - 394
Main Authors: Koepnick, Brian, Flatten, Jeff, Husain, Tamir, Ford, Alex, Silva, Daniel-Adriano, Bick, Matthew J., Bauer, Aaron, Liu, Gaohua, Ishida, Yojiro, Boykov, Alexander, Estep, Roger D., Kleinfelter, Susan, Nørgård-Solano, Toke, Wei, Linda, Players, Foldit, Montelione, Gaetano T., DiMaio, Frank, Popović, Zoran, Khatib, Firas, Cooper, Seth, Baker, David
Format: Journal Article
Language:English
Published: London Nature Publishing Group UK 01-06-2019
Nature Publishing Group
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Online citizen science projects such as GalaxyZoo 1 , Eyewire 2 and Phylo 3 have proven very successful for data collection, annotation and processing, but for the most part have harnessed human pattern-recognition skills rather than human creativity. An exception is the game EteRNA 4 , in which game players learn to build new RNA structures by exploring the discrete two-dimensional space of Watson–Crick base pairing possibilities. Building new proteins, however, is a more challenging task to present in a game, as both the representation and evaluation of a protein structure are intrinsically three-dimensional. We posed the challenge of de novo protein design in the online protein-folding game Foldit 5 . Players were presented with a fully extended peptide chain and challenged to craft a folded protein structure and an amino acid sequence encoding that structure. After many iterations of player design, analysis of the top-scoring solutions and subsequent game improvement, Foldit players can now—starting from an extended polypeptide chain—generate a diversity of protein structures and sequences that encode them in silico. One hundred forty-six Foldit player designs with sequences unrelated to naturally occurring proteins were encoded in synthetic genes; 56 were found to be expressed and soluble in Escherichia coli , and to adopt stable monomeric folded structures in solution. The diversity of these structures is unprecedented in de novo protein design, representing 20 different folds—including a new fold not observed in natural proteins. High-resolution structures were determined for four of the designs, and are nearly identical to the player models. This work makes explicit the considerable implicit knowledge that contributes to success in de novo protein design, and shows that citizen scientists can discover creative new solutions to outstanding scientific challenges such as the protein design problem. Proteins designed de novo by players of the online protein-folding game Foldit can be expressed in Escherichia coli and adopt the designed structure in solution.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
B.K., Z.P., F.K., S.C., and D.B. designed the study. B.K., J.F., T.H., A.F., D.A.S., and S.C. developed Foldit software tools. A. Boykov, R.D.E., S.K., T.N.S., L.W., and Foldit Players designed all proteins. B.K., F.K., A.F., and A. Bauer analyzed Foldit player designs. B.K. performed biophysical characterization. B.K., M.J.B., and F.D. determined crystal structures. G.L., Y.I., and G.T.M. determined the NMR structure. B.K. and D.B. wrote the manuscript with input from all authors. Foldit players contributed extensively through their feedback and gameplay, which generated the data for this paper.
Author contributions
ISSN:0028-0836
1476-4687
DOI:10.1038/s41586-019-1274-4