Comparative analysis of mass spectral matching-based compound identification in gas chromatography–mass spectrometry
•Compared performance of 5 literature-reported compound identification measures.•Estimated dependency of identification accuracy on the size of reference library.•Weight factors dependent on both reference library and spectral similarity measures.•The semi-partial correlation mixture measure perform...
Saved in:
Published in: | Journal of Chromatography A Vol. 1298; pp. 132 - 138 |
---|---|
Main Authors: | , , |
Format: | Journal Article |
Language: | English |
Published: |
Amsterdam
Elsevier B.V
12-07-2013
Elsevier |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •Compared performance of 5 literature-reported compound identification measures.•Estimated dependency of identification accuracy on the size of reference library.•Weight factors dependent on both reference library and spectral similarity measures.•The semi-partial correlation mixture measure performs the best.
Compound identification in gas chromatography–mass spectrometry (GC–MS) is usually achieved by matching query spectra to spectra present in a reference library. Although several spectral similarity measures have been developed and compared using a small reference library, it still remains unknown how the relationship between the spectral similarity measure and the size of reference library affects on the identification accuracy as well as the optimal weight factor. We used three reference libraries to investigate the dependency of the optimal weight factor, spectral similarity measure and the size of reference library. Our study demonstrated that the optimal weight factor depends on not only spectral similarity measure but also the size of reference library. The mixture semi-partial correlation measure outperforms all existing spectral similarity measures in all tested reference libraries, in spite of the computational expense. Furthermore, the accuracy of compound identification using a larger reference library in future is estimated by varying the size of reference library. Simulation study indicates that the mixture semi-partial correlation measure will have the best performance with the increase of reference library in future. |
---|---|
Bibliography: | http://dx.doi.org/10.1016/j.chroma.2013.05.021 ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 0021-9673 1873-3778 |
DOI: | 10.1016/j.chroma.2013.05.021 |