KEC: unique sequence search by K-mer exclusion

Abstract Summary Searching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer elimination by cross-reference (KEC) allows users to quickly and easily find unique sequences by providing target and non-target sequences....

Full description

Saved in:
Bibliographic Details
Published in:Bioinformatics (Oxford, England) Vol. 37; no. 19; pp. 3349 - 3350
Main Authors: Beran, Pavel, Stehlíková, Dagmar, Cohen, Stephen P, Čurn, Vladislav
Format: Journal Article
Language:English
Published: England Oxford University Press 11-10-2021
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Abstract Summary Searching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer elimination by cross-reference (KEC) allows users to quickly and easily find unique sequences by providing target and non-target sequences. Due to its speed, it can be used for datasets of genomic size and can be run on desktop or laptop computers with modest specifications. Availability and implementation KEC is freely available for non-commercial purposes. Source code and executable binary files compiled for Linux, Mac and Windows can be downloaded from https://github.com/berybox/KEC. Supplementary information Supplementary data are available at Bioinformatics online.
AbstractList Searching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer elimination by cross-reference (KEC) allows users to quickly and easily find unique sequences by providing target and non-target sequences. Due to its speed, it can be used for datasets of genomic size and can be run on desktop or laptop computers with modest specifications. KEC is freely available for non-commercial purposes. Source code and executable binary files compiled for Linux, Mac and Windows can be downloaded from https://github.com/berybox/KEC. Supplementary data are available at Bioinformatics online.
SUMMARYSearching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer elimination by cross-reference (KEC) allows users to quickly and easily find unique sequences by providing target and non-target sequences. Due to its speed, it can be used for datasets of genomic size and can be run on desktop or laptop computers with modest specifications. AVAILABILITY AND IMPLEMENTATIONKEC is freely available for non-commercial purposes. Source code and executable binary files compiled for Linux, Mac and Windows can be downloaded from https://github.com/berybox/KEC. SUPPLEMENTARY INFORMATIONSupplementary data are available at Bioinformatics online.
Abstract Summary Searching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer elimination by cross-reference (KEC) allows users to quickly and easily find unique sequences by providing target and non-target sequences. Due to its speed, it can be used for datasets of genomic size and can be run on desktop or laptop computers with modest specifications. Availability and implementation KEC is freely available for non-commercial purposes. Source code and executable binary files compiled for Linux, Mac and Windows can be downloaded from https://github.com/berybox/KEC. Supplementary information Supplementary data are available at Bioinformatics online.
Author Beran, Pavel
Čurn, Vladislav
Stehlíková, Dagmar
Cohen, Stephen P
Author_xml – sequence: 1
  givenname: Pavel
  orcidid: 0000-0002-2680-3958
  surname: Beran
  fullname: Beran, Pavel
  email: beranp02@jcu.cz
– sequence: 2
  givenname: Dagmar
  surname: Stehlíková
  fullname: Stehlíková, Dagmar
– sequence: 3
  givenname: Stephen P
  surname: Cohen
  fullname: Cohen, Stephen P
– sequence: 4
  givenname: Vladislav
  surname: Čurn
  fullname: Čurn, Vladislav
BackLink https://www.ncbi.nlm.nih.gov/pubmed/33755102$$D View this record in MEDLINE/PubMed
BookMark eNqNkEtPwzAQhC0Eog_4C1WOXNJ641fCDVXloVbiAufIdjfCKImLnUj035OqpRI3Lrtz-GZ2NRNy2foWCZkBnQMt2MI479rKh0Z3zsaF6bSBQl6QMTCpUp4DXJ41ZSMyifGTUiqokNdkxJgSAmg2JvP1anmf9K376jGJOMzWHoQO9iMx-2SdNhgS_LZ1H51vb8hVpeuIt6c9Je-Pq7flc7p5fXpZPmxSyyl0KWqDjGdCVznmwiqxNdIKtZXc8koWSudZUXCNTIPgGSBXFDLglQEoKiYyNiV3x9xd8MNPsSsbFy3WtW7R97HMBOWMg6IHVB5RG3yMAatyF1yjw74EWh66Kv92VZ66Goyz043eNLg9237LGQA4Ar7f_Tf0B4oafHI
CitedBy_id crossref_primary_10_1094_PDIS_05_22_1098_RE
crossref_primary_10_1094_PDIS_10_23_2101_SR
Cites_doi 10.1038/s41598-018-32295-4
10.1094/PDIS-10-18-1819-RE
ContentType Journal Article
Copyright The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com 2021
The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Copyright_xml – notice: The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com 2021
– notice: The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
DBID NPM
AAYXX
CITATION
7X8
DOI 10.1093/bioinformatics/btab196
DatabaseName PubMed
CrossRef
MEDLINE - Academic
DatabaseTitle PubMed
CrossRef
MEDLINE - Academic
DatabaseTitleList PubMed
MEDLINE - Academic

DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1367-4811
Editor Robinson, Peter
Editor_xml – sequence: 1
  givenname: Peter
  surname: Robinson
  fullname: Robinson, Peter
EndPage 3350
ExternalDocumentID 10_1093_bioinformatics_btab196
33755102
10.1093/bioinformatics/btab196
Genre Journal Article
GrantInformation_xml – fundername: AFRI Education and Workforce Development Postdoctoral Fellowship
  grantid: 2018-08122
– fundername: Ministry of Education, Youths, and Sports
  grantid: MSMT-15739/2019-8
– fundername: European Cooperation in Science and Technology
  grantid: CA16107
– fundername: U.S. Department of Agriculture, National Institute of Food and Agriculture
GroupedDBID ---
-E4
-~X
.-4
.2P
.DC
.GJ
.I3
0R~
1TH
23N
2WC
4.4
48X
53G
5GY
5WA
70D
AAIJN
AAIMJ
AAJKP
AAJQQ
AAKPC
AAMDB
AAMVS
AAOGV
AAPQZ
AAPXW
AASNB
AAUQX
AAVAP
AAVLN
ABEFU
ABEUO
ABIXL
ABNKS
ABPTD
ABQLI
ABQTQ
ABWST
ABXVV
ABZBJ
ACGFS
ACIWK
ACMRT
ACPRK
ACUFI
ACYTK
ADBBV
ADEYI
ADEZT
ADFTL
ADGKP
ADGZP
ADHKW
ADHZD
ADOCK
ADPDF
ADRDM
ADRIX
ADRTK
ADVEK
ADYVW
ADZTZ
ADZXQ
AECKG
AEGPL
AEJOX
AEKKA
AEKSI
AELWJ
AEMDU
AENEX
AENZO
AEPUE
AETBJ
AEWNT
AFFNX
AFFZL
AFGWE
AFIYH
AFOFC
AFRAH
AFXEN
AGINJ
AGKEF
AGQXC
AGSYK
AHMBA
AHXPO
AI.
AIJHB
AJEEA
AJEUX
AKHUL
AKWXX
ALMA_UNASSIGNED_HOLDINGS
ALTZX
ALUQC
APIBT
APWMN
AQDSO
ARIXL
ASPBG
ATTQO
AVWKF
AXUDD
AYOIW
AZFZN
AZVOD
BAWUL
BAYMD
BCRHZ
BHONS
BQDIO
BQUQU
BSWAC
BTQHN
C1A
C45
CAG
CDBKE
COF
CS3
CZ4
DAKXR
DIK
DILTD
DU5
D~K
EBD
EBS
EE~
EJD
ELUNK
EMOBN
F5P
F9B
FEDTE
FHSFR
FLIZI
FLUFQ
FOEOM
FQBLK
GAUVT
GJXCC
GROUPED_DOAJ
GX1
H13
H5~
HAR
HVGLF
HW0
HZ~
IOX
J21
JXSIZ
KAQDR
KC5
KOP
KQ8
KSI
KSN
M-Z
M49
MK~
ML0
N9A
NGC
NLBLG
NMDNZ
NOMLY
NTWIH
NU-
NVLIB
O0~
O9-
OAWHX
ODMLO
OJQWA
OK1
OVD
OVEED
O~Y
P2P
PAFKI
PB-
PEELM
PQQKQ
Q1.
Q5Y
R44
RD5
RIG
RNI
RNS
ROL
ROX
RPM
RUSNO
RW1
RXO
RZF
RZO
SV3
TEORI
TJP
TLC
TOX
TR2
VH1
W8F
WOQ
X7H
XJT
YAYTL
YKOAZ
YXANX
ZGI
ZKX
~91
~KM
NPM
AAYXX
ABEJV
CITATION
7X8
ID FETCH-LOGICAL-c401t-eabe3425af8e85c75db6c57d64c4f697a82994ae3a15421e4701214fb119f3523
ISSN 1367-4803
IngestDate Fri Oct 25 05:55:14 EDT 2024
Thu Nov 21 23:20:47 EST 2024
Wed Oct 16 00:39:41 EDT 2024
Wed Aug 28 03:15:45 EDT 2024
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 19
Language English
License This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)
The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c401t-eabe3425af8e85c75db6c57d64c4f697a82994ae3a15421e4701214fb119f3523
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ORCID 0000-0002-2680-3958
OpenAccessLink https://academic.oup.com/bioinformatics/article-pdf/37/19/3349/40556585/btab196.pdf
PMID 33755102
PQID 2504341702
PQPubID 23479
PageCount 2
ParticipantIDs proquest_miscellaneous_2504341702
crossref_primary_10_1093_bioinformatics_btab196
pubmed_primary_33755102
oup_primary_10_1093_bioinformatics_btab196
PublicationCentury 2000
PublicationDate 2021-Oct-11
PublicationDateYYYYMMDD 2021-10-11
PublicationDate_xml – month: 10
  year: 2021
  text: 2021-Oct-11
  day: 11
PublicationDecade 2020
PublicationPlace England
PublicationPlace_xml – name: England
PublicationTitle Bioinformatics (Oxford, England)
PublicationTitleAlternate Bioinformatics
PublicationYear 2021
Publisher Oxford University Press
Publisher_xml – name: Oxford University Press
References Karim (2023051608261159600_btab196-B1) 2019; 103
Panyukov (2023051608261159600_btab196-B3) 2017; 12
Larrea-Sarmiento (2023051608261159600_btab196-B2) 2018; 8
References_xml – volume: 8
  start-page: 14298
  year: 2018
  ident: 2023051608261159600_btab196-B2
  article-title: Development of a genome-informed loop-mediated isothermal amplification assay for rapid and specific detection of Xanthomonas euvesicatoria
  publication-title: Sci. Rep
  doi: 10.1038/s41598-018-32295-4
  contributor:
    fullname: Larrea-Sarmiento
– volume: 103
  start-page: 2893
  year: 2019
  ident: 2023051608261159600_btab196-B1
  article-title: Development of the automated primer design workflow uniqprimer and diagnostic primers for the broad-host-range plant pathogen Dickeya dianthicola
  publication-title: Plant Dis
  doi: 10.1094/PDIS-10-18-1819-RE
  contributor:
    fullname: Karim
– volume: 12
  start-page: 547
  year: 2017
  ident: 2023051608261159600_btab196-B3
  article-title: Short unique sequences in bacterial genomes as strain- and species-specific signatures
  publication-title: Math. Biol. Bioinf
  contributor:
    fullname: Panyukov
SSID ssj0005056
Score 2.444412
Snippet Abstract Summary Searching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets....
Searching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer elimination by...
SUMMARYSearching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer...
SourceID proquest
crossref
pubmed
oup
SourceType Aggregation Database
Index Database
Publisher
StartPage 3349
Title KEC: unique sequence search by K-mer exclusion
URI https://www.ncbi.nlm.nih.gov/pubmed/33755102
https://search.proquest.com/docview/2504341702
Volume 37
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3dT9tADD8Vpkm8TGMDVgYok_YECk1yl16ONyhFSEjTJD60t-gucRlaaVHbVPDf49xHmoAQ8LCXKHKbq2o7Pttn_0zIT6UEK1OBfqwi4TPKuC84pT7PgQUZ7oFCd3ifnvNff5LjPuu3Wm4m3oL2XyWNNJR12Tn7DmlXiyIB71HmeEWp4_VNcj_r98oovzDArK5Ses9mN9DZPPNvYbIH99mwmDqhuFPdm7EFUtXgzSUS6b0rfrfTPmqZgyOYmOzpbzmHqlDjfAZ_h_r4_fjfeK5vQlM8f30rq0rgqivEVpkt2sy0d9srTI3x1VDmN6i083pyItLVceEiOfFC02PN3tISdj0JjI2DOs0uY420QYZxyihqJpdSg3lqt29KDZDts63BwGapBiNLwkyqUDxB47bh0WsPLZEPEdo35pJErrIo0EODq__mGtMF7TRX6th1Gj5Ro8_yWbij3Z6Lz-STjVe8Q6Noq6QFoy_ko5lg-vCV7KO6HXhG2TynbJ5RNk89eFrZvErZ1sjlSf-id-rbERx-hoH3zAepgKJZl4MEkjjjca66WczzLsvYoCu4TNCdYRKoRFc8CoHxEiOQDVQYigH69nSdLI_GI_hGvEQFucSPEgYSffhEYLAfIFHJMA9Axm3ScUxI7wzSSmoqJGjaZFtq2dYmu8irN3_5h2Npiha0PBaTIxgX01SD-LGQB1GbbBheV2tSyjGkCKLN9_zUd7KyeBe2yPJsUsA2WZrmxY5WlEdbB5pI
link.rule.ids 315,782,786,27934,27935
linkProvider National Library of Medicine
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=KEC%3A+unique+sequence+search+by+K-mer+exclusion&rft.jtitle=Bioinformatics+%28Oxford%2C+England%29&rft.au=Beran%2C+Pavel&rft.au=Stehl%C3%ADkov%C3%A1%2C+Dagmar&rft.au=Cohen%2C+Stephen+P&rft.au=%C4%8Curn%2C+Vladislav&rft.date=2021-10-11&rft.pub=Oxford+University+Press&rft.issn=1367-4803&rft.eissn=1367-4811&rft.volume=37&rft.issue=19&rft.spage=3349&rft.epage=3350&rft_id=info:doi/10.1093%2Fbioinformatics%2Fbtab196&rft.externalDocID=10.1093%2Fbioinformatics%2Fbtab196
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1367-4803&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1367-4803&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1367-4803&client=summon