Mislocalization-related disease gene discovery using gene expression based computational protein localization prediction
•Predicting cancer disease genes related to protein mislocalization.•Complementary to genome sequencing for identifying cancer genes.•Generating hypothesis on potential mechanisms underlying cancer. Protein sorting is an important mechanism for transporting proteins to their target subcellular locat...
Saved in:
Published in: | Methods (San Diego, Calif.) Vol. 93; pp. 119 - 127 |
---|---|
Main Authors: | , |
Format: | Journal Article |
Language: | English |
Published: |
United States
Elsevier Inc
15-01-2016
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •Predicting cancer disease genes related to protein mislocalization.•Complementary to genome sequencing for identifying cancer genes.•Generating hypothesis on potential mechanisms underlying cancer.
Protein sorting is an important mechanism for transporting proteins to their target subcellular locations after their synthesis. Mutations on genes may disrupt the well regulated protein sorting process, leading to a variety of mislocation related diseases. This paper proposes a methodology to discover such disease genes based on gene expression data and computational protein localization prediction. A kernel logistic regression based algorithm is used to successfully identify several candidate cancer genes which may cause cancers due to their mislocation within the cell. Our results also showed that compared to the gene co-expression network defined on Pearson correlation coefficients, the nonlinear Maximum Correlation Coefficients (MIC) based co-expression network give better results for subcellular localization prediction. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1046-2023 1095-9130 |
DOI: | 10.1016/j.ymeth.2015.09.022 |