Applying data science methodologies with artificial intelligence variant reinterpretation to map and estimate genetic disorder prevalence utilizing clinical data
Data science methodologies can be utilized to ascertain and analyze clinical genetic data that is often unstructured and rarely used outside of patient encounters. Genetic variants from all genetic testing resulting to a large pediatric healthcare system for a 5‐year period were obtained and reinter...
Saved in:
Published in: | American journal of medical genetics. Part A Vol. 194; no. 5; pp. e63505 - n/a |
---|---|
Main Authors: | , , , , , , , , , , , , , , , , , , , |
Format: | Journal Article |
Language: | English |
Published: |
Hoboken, USA
John Wiley & Sons, Inc
01-05-2024
Wiley Subscription Services, Inc |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Data science methodologies can be utilized to ascertain and analyze clinical genetic data that is often unstructured and rarely used outside of patient encounters. Genetic variants from all genetic testing resulting to a large pediatric healthcare system for a 5‐year period were obtained and reinterpreted utilizing the previously validated Franklin© Artificial Intelligence (AI). Using PowerBI©, the data were further matched to patients in the electronic healthcare record to associate with demographic data to generate a variant data table and mapped by ZIP codes. Three thousand and sixty‐five variants were identified and 98% were matched to patients with geographic data. Franklin© changed the interpretation for 24% of variants. One hundred and fifty‐six clinically actionable variant reinterpretations were made. A total of 739 Mendelian genetic disorders were identified with disorder prevalence estimation. Mapping of variants demonstrated hot‐spots for pathogenic genetic variation such as PEX6‐associated Zellweger Spectrum Disorder. Seven patients were identified with Bardet‐Biedl syndrome and seven patients with Rett syndrome amenable to newly FDA‐approved therapeutics. Utilizing readily available software we developed a database and Exploratory Data Analysis (EDA) methodology enabling us to systematically reinterpret variants, estimate variant prevalence, identify conditions amenable to new treatments, and localize geographies enriched for pathogenic variants. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1552-4825 1552-4833 1552-4833 |
DOI: | 10.1002/ajmg.a.63505 |