Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation

Design and interpretation of genome sequencing assays in clinical diagnostics and research labs is complicated by an inability to identify information from the medical literature and related databases quickly, comprehensively and reproducibly. This challenge is compounded by the complexity and heter...

Full description

Saved in:
Bibliographic Details
Published in:Frontiers in genetics Vol. 11; p. 577152
Main Authors: Chunn, Lauren M, Nefcy, Diane C, Scouten, Rachel W, Tarpey, Ryan P, Chauhan, Gurinder, Lim, Megan S, Elenitoba-Johnson, Kojo S J, Schwartz, Steven A, Kiel, Mark J
Format: Journal Article
Language:English
Published: Switzerland Frontiers Media S.A 13-11-2020
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Design and interpretation of genome sequencing assays in clinical diagnostics and research labs is complicated by an inability to identify information from the medical literature and related databases quickly, comprehensively and reproducibly. This challenge is compounded by the complexity and heterogeneity of nomenclatures used to describe diseases, genes and genetic variants. Mastermind is a widely-used bioinformatic platform of genomic associations that has indexed more than 7.5 M full-text articles and 2.5 M supplemental datasets. It has automatically identified, disambiguated and annotated >6.1 M genetic variants and identified >50 K disease-gene associations. Here, we describe how Mastermind improves the sensitivity and reproducibility of clinical variant interpretation and produces comprehensive genomic landscapes of genetic variants driving pharmaceutical research. We demonstrate an alarmingly high degree of heterogeneity across commercially available panels for hereditary cancer that is resolved by evidence from Mastermind. We further examined the sensitivity of Mastermind for variant interpretation by examining 108 clinically-encountered variants and comparing the results to alternate methods. Mastermind demonstrated a sensitivity of 98.4% compared to 4.4, 45.6, and 37.4% for alternatives PubMed, Google Scholar, and ClinVar, respectively, and a specificity of 98.5% compared to 45.1, 57.6, and 68.8% as well as an increase in content yield of 22.6-, 2.2-, and 2.6-fold. When curated for clinical significance, Mastermind identified more than 4.9-fold more pathogenic variants than ClinVar for representative genes. For structural variants, we compared Mastermind's ability to sensitively identify evidence for 10 representative disease-causing CNVs versus results identified in PubMed, as well as its ability to identify evidence for fusion events compared to COSMIC. Mastermind demonstrated a 4.0- to 43.9-fold increase in references for specific CNVs compared to PubMed, as well as 5.4-fold more fusion genes when compared with COSMIC's curated database. Additionally, Mastermind produced an 8.0-fold increase in reference citations for fusion events common to Mastermind and outside databases. Taken together, these results demonstrate the utility and superiority of Mastermind in terms of both sensitivity and specificity of automated results for clinical diagnostic variant interpretation for multiple genetic variant types and highlight the potential benefit in informing pharmaceutical research.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics
Reviewed by: Gavin R. Oliver, Mayo Clinic, United States; Abhishek Kumar, University of Kiel, Germany
Edited by: Nicola Mulder, University of Cape Town, South Africa
ISSN:1664-8021
1664-8021
DOI:10.3389/fgene.2020.577152