KEC: unique sequence search by K-mer exclusion
Abstract Summary Searching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer elimination by cross-reference (KEC) allows users to quickly and easily find unique sequences by providing target and non-target sequences....
Saved in:
Published in: | Bioinformatics (Oxford, England) Vol. 37; no. 19; pp. 3349 - 3350 |
---|---|
Main Authors: | , , , |
Format: | Journal Article |
Language: | English |
Published: |
England
Oxford University Press
11-10-2021
|
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Abstract | Abstract
Summary
Searching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer elimination by cross-reference (KEC) allows users to quickly and easily find unique sequences by providing target and non-target sequences. Due to its speed, it can be used for datasets of genomic size and can be run on desktop or laptop computers with modest specifications.
Availability and implementation
KEC is freely available for non-commercial purposes. Source code and executable binary files compiled for Linux, Mac and Windows can be downloaded from https://github.com/berybox/KEC.
Supplementary information
Supplementary data are available at Bioinformatics online. |
---|---|
AbstractList | Searching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer elimination by cross-reference (KEC) allows users to quickly and easily find unique sequences by providing target and non-target sequences. Due to its speed, it can be used for datasets of genomic size and can be run on desktop or laptop computers with modest specifications.
KEC is freely available for non-commercial purposes. Source code and executable binary files compiled for Linux, Mac and Windows can be downloaded from https://github.com/berybox/KEC.
Supplementary data are available at Bioinformatics online. SUMMARYSearching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer elimination by cross-reference (KEC) allows users to quickly and easily find unique sequences by providing target and non-target sequences. Due to its speed, it can be used for datasets of genomic size and can be run on desktop or laptop computers with modest specifications. AVAILABILITY AND IMPLEMENTATIONKEC is freely available for non-commercial purposes. Source code and executable binary files compiled for Linux, Mac and Windows can be downloaded from https://github.com/berybox/KEC. SUPPLEMENTARY INFORMATIONSupplementary data are available at Bioinformatics online. Abstract Summary Searching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer elimination by cross-reference (KEC) allows users to quickly and easily find unique sequences by providing target and non-target sequences. Due to its speed, it can be used for datasets of genomic size and can be run on desktop or laptop computers with modest specifications. Availability and implementation KEC is freely available for non-commercial purposes. Source code and executable binary files compiled for Linux, Mac and Windows can be downloaded from https://github.com/berybox/KEC. Supplementary information Supplementary data are available at Bioinformatics online. |
Author | Beran, Pavel Čurn, Vladislav Stehlíková, Dagmar Cohen, Stephen P |
Author_xml | – sequence: 1 givenname: Pavel orcidid: 0000-0002-2680-3958 surname: Beran fullname: Beran, Pavel email: beranp02@jcu.cz – sequence: 2 givenname: Dagmar surname: Stehlíková fullname: Stehlíková, Dagmar – sequence: 3 givenname: Stephen P surname: Cohen fullname: Cohen, Stephen P – sequence: 4 givenname: Vladislav surname: Čurn fullname: Čurn, Vladislav |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/33755102$$D View this record in MEDLINE/PubMed |
BookMark | eNqNkEtPwzAQhC0Eog_4C1WOXNJ641fCDVXloVbiAufIdjfCKImLnUj035OqpRI3Lrtz-GZ2NRNy2foWCZkBnQMt2MI479rKh0Z3zsaF6bSBQl6QMTCpUp4DXJ41ZSMyifGTUiqokNdkxJgSAmg2JvP1anmf9K376jGJOMzWHoQO9iMx-2SdNhgS_LZ1H51vb8hVpeuIt6c9Je-Pq7flc7p5fXpZPmxSyyl0KWqDjGdCVznmwiqxNdIKtZXc8koWSudZUXCNTIPgGSBXFDLglQEoKiYyNiV3x9xd8MNPsSsbFy3WtW7R97HMBOWMg6IHVB5RG3yMAatyF1yjw74EWh66Kv92VZ66Goyz043eNLg9237LGQA4Ar7f_Tf0B4oafHI |
CitedBy_id | crossref_primary_10_1094_PDIS_05_22_1098_RE crossref_primary_10_1094_PDIS_10_23_2101_SR |
Cites_doi | 10.1038/s41598-018-32295-4 10.1094/PDIS-10-18-1819-RE |
ContentType | Journal Article |
Copyright | The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com 2021 The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. |
Copyright_xml | – notice: The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com 2021 – notice: The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. |
DBID | NPM AAYXX CITATION 7X8 |
DOI | 10.1093/bioinformatics/btab196 |
DatabaseName | PubMed CrossRef MEDLINE - Academic |
DatabaseTitle | PubMed CrossRef MEDLINE - Academic |
DatabaseTitleList | PubMed MEDLINE - Academic |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Biology |
EISSN | 1367-4811 |
Editor | Robinson, Peter |
Editor_xml | – sequence: 1 givenname: Peter surname: Robinson fullname: Robinson, Peter |
EndPage | 3350 |
ExternalDocumentID | 10_1093_bioinformatics_btab196 33755102 10.1093/bioinformatics/btab196 |
Genre | Journal Article |
GrantInformation_xml | – fundername: AFRI Education and Workforce Development Postdoctoral Fellowship grantid: 2018-08122 – fundername: Ministry of Education, Youths, and Sports grantid: MSMT-15739/2019-8 – fundername: European Cooperation in Science and Technology grantid: CA16107 – fundername: U.S. Department of Agriculture, National Institute of Food and Agriculture |
GroupedDBID | --- -E4 -~X .-4 .2P .DC .GJ .I3 0R~ 1TH 23N 2WC 4.4 48X 53G 5GY 5WA 70D AAIJN AAIMJ AAJKP AAJQQ AAKPC AAMDB AAMVS AAOGV AAPQZ AAPXW AASNB AAUQX AAVAP AAVLN ABEFU ABEUO ABIXL ABNKS ABPTD ABQLI ABQTQ ABWST ABXVV ABZBJ ACGFS ACIWK ACMRT ACPRK ACUFI ACYTK ADBBV ADEYI ADEZT ADFTL ADGKP ADGZP ADHKW ADHZD ADOCK ADPDF ADRDM ADRIX ADRTK ADVEK ADYVW ADZTZ ADZXQ AECKG AEGPL AEJOX AEKKA AEKSI AELWJ AEMDU AENEX AENZO AEPUE AETBJ AEWNT AFFNX AFFZL AFGWE AFIYH AFOFC AFRAH AFXEN AGINJ AGKEF AGQXC AGSYK AHMBA AHXPO AI. AIJHB AJEEA AJEUX AKHUL AKWXX ALMA_UNASSIGNED_HOLDINGS ALTZX ALUQC APIBT APWMN AQDSO ARIXL ASPBG ATTQO AVWKF AXUDD AYOIW AZFZN AZVOD BAWUL BAYMD BCRHZ BHONS BQDIO BQUQU BSWAC BTQHN C1A C45 CAG CDBKE COF CS3 CZ4 DAKXR DIK DILTD DU5 D~K EBD EBS EE~ EJD ELUNK EMOBN F5P F9B FEDTE FHSFR FLIZI FLUFQ FOEOM FQBLK GAUVT GJXCC GROUPED_DOAJ GX1 H13 H5~ HAR HVGLF HW0 HZ~ IOX J21 JXSIZ KAQDR KC5 KOP KQ8 KSI KSN M-Z M49 MK~ ML0 N9A NGC NLBLG NMDNZ NOMLY NTWIH NU- NVLIB O0~ O9- OAWHX ODMLO OJQWA OK1 OVD OVEED O~Y P2P PAFKI PB- PEELM PQQKQ Q1. Q5Y R44 RD5 RIG RNI RNS ROL ROX RPM RUSNO RW1 RXO RZF RZO SV3 TEORI TJP TLC TOX TR2 VH1 W8F WOQ X7H XJT YAYTL YKOAZ YXANX ZGI ZKX ~91 ~KM NPM AAYXX ABEJV CITATION 7X8 |
ID | FETCH-LOGICAL-c401t-eabe3425af8e85c75db6c57d64c4f697a82994ae3a15421e4701214fb119f3523 |
ISSN | 1367-4803 |
IngestDate | Fri Oct 25 05:55:14 EDT 2024 Thu Nov 21 23:20:47 EST 2024 Wed Oct 16 00:39:41 EDT 2024 Wed Aug 28 03:15:45 EDT 2024 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 19 |
Language | English |
License | This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model) The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c401t-eabe3425af8e85c75db6c57d64c4f697a82994ae3a15421e4701214fb119f3523 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ORCID | 0000-0002-2680-3958 |
OpenAccessLink | https://academic.oup.com/bioinformatics/article-pdf/37/19/3349/40556585/btab196.pdf |
PMID | 33755102 |
PQID | 2504341702 |
PQPubID | 23479 |
PageCount | 2 |
ParticipantIDs | proquest_miscellaneous_2504341702 crossref_primary_10_1093_bioinformatics_btab196 pubmed_primary_33755102 oup_primary_10_1093_bioinformatics_btab196 |
PublicationCentury | 2000 |
PublicationDate | 2021-Oct-11 |
PublicationDateYYYYMMDD | 2021-10-11 |
PublicationDate_xml | – month: 10 year: 2021 text: 2021-Oct-11 day: 11 |
PublicationDecade | 2020 |
PublicationPlace | England |
PublicationPlace_xml | – name: England |
PublicationTitle | Bioinformatics (Oxford, England) |
PublicationTitleAlternate | Bioinformatics |
PublicationYear | 2021 |
Publisher | Oxford University Press |
Publisher_xml | – name: Oxford University Press |
References | Karim (2023051608261159600_btab196-B1) 2019; 103 Panyukov (2023051608261159600_btab196-B3) 2017; 12 Larrea-Sarmiento (2023051608261159600_btab196-B2) 2018; 8 |
References_xml | – volume: 8 start-page: 14298 year: 2018 ident: 2023051608261159600_btab196-B2 article-title: Development of a genome-informed loop-mediated isothermal amplification assay for rapid and specific detection of Xanthomonas euvesicatoria publication-title: Sci. Rep doi: 10.1038/s41598-018-32295-4 contributor: fullname: Larrea-Sarmiento – volume: 103 start-page: 2893 year: 2019 ident: 2023051608261159600_btab196-B1 article-title: Development of the automated primer design workflow uniqprimer and diagnostic primers for the broad-host-range plant pathogen Dickeya dianthicola publication-title: Plant Dis doi: 10.1094/PDIS-10-18-1819-RE contributor: fullname: Karim – volume: 12 start-page: 547 year: 2017 ident: 2023051608261159600_btab196-B3 article-title: Short unique sequences in bacterial genomes as strain- and species-specific signatures publication-title: Math. Biol. Bioinf contributor: fullname: Panyukov |
SSID | ssj0005056 |
Score | 2.444412 |
Snippet | Abstract
Summary
Searching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets.... Searching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer elimination by... SUMMARYSearching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer... |
SourceID | proquest crossref pubmed oup |
SourceType | Aggregation Database Index Database Publisher |
StartPage | 3349 |
Title | KEC: unique sequence search by K-mer exclusion |
URI | https://www.ncbi.nlm.nih.gov/pubmed/33755102 https://search.proquest.com/docview/2504341702 |
Volume | 37 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3dT9tADD8Vpkm8TGMDVgYok_YECk1yl16ONyhFSEjTJD60t-gucRlaaVHbVPDf49xHmoAQ8LCXKHKbq2o7Pttn_0zIT6UEK1OBfqwi4TPKuC84pT7PgQUZ7oFCd3ifnvNff5LjPuu3Wm4m3oL2XyWNNJR12Tn7DmlXiyIB71HmeEWp4_VNcj_r98oovzDArK5Ses9mN9DZPPNvYbIH99mwmDqhuFPdm7EFUtXgzSUS6b0rfrfTPmqZgyOYmOzpbzmHqlDjfAZ_h_r4_fjfeK5vQlM8f30rq0rgqivEVpkt2sy0d9srTI3x1VDmN6i083pyItLVceEiOfFC02PN3tISdj0JjI2DOs0uY420QYZxyihqJpdSg3lqt29KDZDts63BwGapBiNLwkyqUDxB47bh0WsPLZEPEdo35pJErrIo0EODq__mGtMF7TRX6th1Gj5Ro8_yWbij3Z6Lz-STjVe8Q6Noq6QFoy_ko5lg-vCV7KO6HXhG2TynbJ5RNk89eFrZvErZ1sjlSf-id-rbERx-hoH3zAepgKJZl4MEkjjjca66WczzLsvYoCu4TNCdYRKoRFc8CoHxEiOQDVQYigH69nSdLI_GI_hGvEQFucSPEgYSffhEYLAfIFHJMA9Axm3ScUxI7wzSSmoqJGjaZFtq2dYmu8irN3_5h2Npiha0PBaTIxgX01SD-LGQB1GbbBheV2tSyjGkCKLN9_zUd7KyeBe2yPJsUsA2WZrmxY5WlEdbB5pI |
link.rule.ids | 315,782,786,27934,27935 |
linkProvider | National Library of Medicine |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=KEC%3A+unique+sequence+search+by+K-mer+exclusion&rft.jtitle=Bioinformatics+%28Oxford%2C+England%29&rft.au=Beran%2C+Pavel&rft.au=Stehl%C3%ADkov%C3%A1%2C+Dagmar&rft.au=Cohen%2C+Stephen+P&rft.au=%C4%8Curn%2C+Vladislav&rft.date=2021-10-11&rft.pub=Oxford+University+Press&rft.issn=1367-4803&rft.eissn=1367-4811&rft.volume=37&rft.issue=19&rft.spage=3349&rft.epage=3350&rft_id=info:doi/10.1093%2Fbioinformatics%2Fbtab196&rft.externalDocID=10.1093%2Fbioinformatics%2Fbtab196 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1367-4803&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1367-4803&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1367-4803&client=summon |