An algorithm for random match probability calculation from peptide sequences

•Random match probability from peptide sequences.•Accommodates LD.•Robust to drop-in events.•Open source. For the past three decades, forensic genetic investigations have focused on elucidating DNA signatures. While DNA has a number of desirable properties (e.g., presence in most biological material...

Full description

Saved in:

Bibliographic Details
Published in:	Forensic science international : genetics Vol. 47; p. 102295
Main Authors:	Woerner, August E., Hewitt, F. Curtis, Gardner, Myles W., Freitas, Michael A., Schulte, Kathleen Q., LeSassier, Danielle S., Baniasad, Maryam, Reed, Andrew J., Powals, Megan E., Smith, Alan R., Albright, Nicolette C., Ludolph, Benjamin C., Zhang, Liwen, Allen, Leah W., Weber, Katharina, Budowle, Bruce
Format:	Journal Article
Language:	English
Published:	Netherlands Elsevier B.V 01-07-2020
Subjects:	Exome sequencing Genetically variable peptides Liquid chromatography–tandem mass spectrometry Proteomics Random match probability Random match probability Genetically variable peptides Liquid chromatography–tandem mass spectrometry Exome sequencing Proteomics
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	•Random match probability from peptide sequences.•Accommodates LD.•Robust to drop-in events.•Open source. For the past three decades, forensic genetic investigations have focused on elucidating DNA signatures. While DNA has a number of desirable properties (e.g., presence in most biological materials, an amenable chemistry for analysis and well-developed statistics), DNA also has limitations. DNA may be in low quantity in some tissues, such as hair, and in some tissues it may degrade more readily than its protein counterparts. Recent research efforts have shown the feasibility of performing protein-based human identification in cases in which recovery of DNA is challenged; however, the methods involved in assessing the rarity of a given protein profile have not been addressed adequately. In this paper an algorithm is proposed that describes the computation of a random match probability (RMP) resulting from a genetically variable peptide signature. The approach described herein explicitly models proteomic error and genetic linkage, makes no assumptions as to allelic drop-out, and maps the observed proteomic alleles to their expected protein products from DNA which, in turn, permits standard corrections for population structure and finite database sizes. To assess the feasibility of this approach, RMPs were estimated from peptide profiles of skin samples from 25 individuals of European ancestry. 126 common peptide alleles were used in this approach, yielding a mean RMP of approximately 10−2.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1872-4973 1878-0326
DOI:	10.1016/j.fsigen.2020.102295