Exploring Latent Space for Generating Peptide Analogs Using Protein Language Models

Generating peptides with desired properties is crucial for drug discovery and biotechnology. Traditional sequence-based and structure-based methods often require extensive datasets, which limits their effectiveness. In this study, we proposed a novel method that utilized autoencoder shaped models to...

Full description

Saved in:
Bibliographic Details
Main Authors: Liang, Po-Yu, Huang, Xueting, Duran, Tibo, Wiemer, Andrew J, Bai, Jun
Format: Journal Article
Language:English
Published: 15-08-2024
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Generating peptides with desired properties is crucial for drug discovery and biotechnology. Traditional sequence-based and structure-based methods often require extensive datasets, which limits their effectiveness. In this study, we proposed a novel method that utilized autoencoder shaped models to explore the protein embedding space, and generate novel peptide analogs by leveraging protein language models. The proposed method requires only a single sequence of interest, avoiding the need for large datasets. Our results show significant improvements over baseline models in similarity indicators of peptide structures, descriptors and bioactivities. The proposed method validated through Molecular Dynamics simulations on TIGIT inhibitors, demonstrates that our method produces peptide analogs with similar yet distinct properties, highlighting its potential to enhance peptide screening processes.
AbstractList Generating peptides with desired properties is crucial for drug discovery and biotechnology. Traditional sequence-based and structure-based methods often require extensive datasets, which limits their effectiveness. In this study, we proposed a novel method that utilized autoencoder shaped models to explore the protein embedding space, and generate novel peptide analogs by leveraging protein language models. The proposed method requires only a single sequence of interest, avoiding the need for large datasets. Our results show significant improvements over baseline models in similarity indicators of peptide structures, descriptors and bioactivities. The proposed method validated through Molecular Dynamics simulations on TIGIT inhibitors, demonstrates that our method produces peptide analogs with similar yet distinct properties, highlighting its potential to enhance peptide screening processes.
Author Bai, Jun
Huang, Xueting
Wiemer, Andrew J
Duran, Tibo
Liang, Po-Yu
Author_xml – sequence: 1
  givenname: Po-Yu
  surname: Liang
  fullname: Liang, Po-Yu
– sequence: 2
  givenname: Xueting
  surname: Huang
  fullname: Huang, Xueting
– sequence: 3
  givenname: Tibo
  surname: Duran
  fullname: Duran, Tibo
– sequence: 4
  givenname: Andrew J
  surname: Wiemer
  fullname: Wiemer, Andrew J
– sequence: 5
  givenname: Jun
  surname: Bai
  fullname: Bai, Jun
BackLink https://doi.org/10.48550/arXiv.2408.08341$$DView paper in arXiv
BookMark eNqFjr0KwjAUhTPo4N8DOJkXsKa2ha4iVQcFoXUuF3sbAvEmJFHq22uLu9OB88P5pmxEhpCxZSyiNM8ysQHXqVe0TUUeiTxJ4wkri85q4xRJfoaAFHhp4Y68NY4fkdBB6LMr2qAa5DsCbaTnNz-4zgRU9F2SfIJEfjENaj9n4xa0x8VPZ2x1KKr9aT3c19apB7h33WPUA0byv_EBC1s_NQ
ContentType Journal Article
Copyright http://creativecommons.org/licenses/by/4.0
Copyright_xml – notice: http://creativecommons.org/licenses/by/4.0
DBID AKY
ALC
GOX
DOI 10.48550/arxiv.2408.08341
DatabaseName arXiv Computer Science
arXiv Quantitative Biology
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2408_08341
GroupedDBID AKY
ALC
GOX
ID FETCH-arxiv_primary_2408_083413
IEDL.DBID GOX
IngestDate Wed Aug 21 12:22:29 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_2408_083413
OpenAccessLink https://arxiv.org/abs/2408.08341
ParticipantIDs arxiv_primary_2408_08341
PublicationCentury 2000
PublicationDate 2024-08-15
PublicationDateYYYYMMDD 2024-08-15
PublicationDate_xml – month: 08
  year: 2024
  text: 2024-08-15
  day: 15
PublicationDecade 2020
PublicationYear 2024
Score 3.8640625
SecondaryResourceType preprint
Snippet Generating peptides with desired properties is crucial for drug discovery and biotechnology. Traditional sequence-based and structure-based methods often...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Artificial Intelligence
Computer Science - Learning
Quantitative Biology - Quantitative Methods
Title Exploring Latent Space for Generating Peptide Analogs Using Protein Language Models
URI https://arxiv.org/abs/2408.08341
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdZ09b8IwEIZPhakLoqIVLbS9oWta4iQkjFXLx4AoUjqwRXbsVEhVQAQqfj53NrRdWP2lkz3ce77zY4CnIiAfIrTxijzmNKOSnuzngZfoxAzCwteR_T9lksazRfI-ZEwOnt7CyM1--eP4wKp6Yf7WM4kEfpleE4JLtsYfC5ectCiu4_i_caQxbdM_JzFqQuOo7vDVHccVXJiyBelvnRtOSdmVW0wpUDVIchEd9ZlLj3HO9SXaIGNCVl8V2mQ-zpmjsCxpprtXRP687Lu6hsfR8PNt4lkzsrVjRmRsYWYtDG6gTpG9aQOSliiE7DMlMQrzXqyUDHuhkr5Wg4B01C20z61yd76rA5eCPC9ffPpRF-rbzc7cQ63Suwe7fQf4pXMZ
link.rule.ids 228,230,782,887
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Exploring+Latent+Space+for+Generating+Peptide+Analogs+Using+Protein+Language+Models&rft.au=Liang%2C+Po-Yu&rft.au=Huang%2C+Xueting&rft.au=Duran%2C+Tibo&rft.au=Wiemer%2C+Andrew+J&rft.date=2024-08-15&rft_id=info:doi/10.48550%2Farxiv.2408.08341&rft.externalDocID=2408_08341