Integration of pathway structure information into a reweighted partial Cox regression approach for survival analysis on high-dimensional gene expression data

Accurately predicting the risk of cancer relapse or death is important for clinical utility. The emerging high-dimensional gene expression data provide the opportunity as well as the challenge to uncover the relationship between gene expression and censored survival outcome. While several Cox models...

Full description

Saved in:
Bibliographic Details
Published in:Molecular bioSystems Vol. 11; no. 7; p. 1876
Main Authors: Liu, Wei, Wang, Qiuyu, Zhao, Jianmei, Zhang, Chunlong, Liu, Yuejuan, Zhang, Jian, Bai, Xuefeng, Li, Xuecang, Feng, Houming, Liao, Mingzhi, Wang, Wei, Li, Chunquan
Format: Journal Article
Language:English
Published: England 01-07-2015
Subjects:
Online Access:Get more information
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Accurately predicting the risk of cancer relapse or death is important for clinical utility. The emerging high-dimensional gene expression data provide the opportunity as well as the challenge to uncover the relationship between gene expression and censored survival outcome. While several Cox models have been proposed to deal with high-dimensional covariates and censored continuous survival data, they usually generalize poorly to independent datasets. Most methods build the Cox model exclusively on gene expression data, but ignore the molecular interaction relation among genes, which has been successfully integrated into molecular classification with categorical outcomes and improved predictive performance. Here, we integrate gene-interaction information into a Cox model and propose a reweighted partial Cox regression (RPCR) approach in order to accurately predict the risk of cancer events. RPCR improves the predictive accuracy and generalization of a Cox model by promoting genes with large topological importance, which is evaluated by a directed random walk in a reconstructed global pathway graph. We applied RPCR to the survival prediction of two cancer types and used two concordance statistic measures to assess the prediction performance. Both within-dataset experiments and cross-dataset experiments showed that RPCR could predict the risk of patients with higher accuracy and greater robustness.
ISSN:1742-2051
DOI:10.1039/c5mb00044k