Predicting antibody binders and generating synthetic antibodies using deep learning

The antibody drug field has continually sought improvements to methods for candidate discovery and engineering. Historically, most such methods have been laboratory-based, but informatics methods have recently started to make an impact. Deep learning, a subfield of machine learning, is rapidly gaini...

Full description

Saved in:

Bibliographic Details
Published in:	mAbs Vol. 14; no. 1; p. 2069075
Main Authors:	Lim, Yoong Wearn, Adler, Adam S, Johnson, David S
Format:	Journal Article
Language:	English
Published:	United States Taylor & Francis 31-12-2022 Taylor & Francis Group
Subjects:	Antibodies Antibody repertoires Complementarity Determining Regions - chemistry convolutional neural networks CTLA-4 Antigen Deep Learning deep sequencing generative adversarial networks machine learning Programmed Cell Death 1 Receptor generative adversarial networks deep learning Antibody repertoires convolutional neural networks machine learning deep sequencing
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	The antibody drug field has continually sought improvements to methods for candidate discovery and engineering. Historically, most such methods have been laboratory-based, but informatics methods have recently started to make an impact. Deep learning, a subfield of machine learning, is rapidly gaining prominence in the biomedical research. Recent advances in microfluidics technologies and next-generation sequencing have not only revolutionized therapeutic antibody discovery, but also contributed to a vast amount of antibody repertoire sequencing data, providing opportunities for deep learning-based applications. Previously, we used microfluidics, yeast display, and deep sequencing to generate a panel of binder and non-binder antibody sequences to the cancer immunotherapy targets PD-1 and CTLA-4. Here we encoded the antibody light and heavy chain complementarity-determining regions (CDR3s) into antibody images, then built and trained convolutional neural network models to classify binders and non-binders. To improve model interpretability, we performed mutagenesis to identify CDR3 residues that were important for binder classification. We further built generative deep learning models using generative adversarial network models to produce synthetic antibodies against PD-1 and CTLA-4. Our models generated variable length CDR3 sequences that resemble real sequences. Overall, our study demonstrates that deep learning methods can be leveraged to mine and learn patterns in antibody sequences, offering insights into antibody engineering, optimization, and discovery.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1942-0862 1942-0870 1942-0870
DOI:	10.1080/19420862.2022.2069075