Testing multiple gene interactions by the ordered combinatorial partitioning method in case–control studies

Motivation: The multifactor-dimensionality reduction (MDR) method has been widely used in multi-locus interaction analysis. It reduces dimensionality by partitioning the multi-locus genotypes into a high-risk group and a low-risk group according to whether the genotype-specific risk ratio exceeds a...

Full description

Saved in:
Bibliographic Details
Published in:Bioinformatics Vol. 26; no. 15; pp. 1871 - 1878
Main Authors: Hua, Xing, Zhang, Han, Zhang, Hong, Yang, Yaning, Kuk, Anthony Y.C.
Format: Journal Article
Language:English
Published: Oxford Oxford University Press 01-08-2010
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Motivation: The multifactor-dimensionality reduction (MDR) method has been widely used in multi-locus interaction analysis. It reduces dimensionality by partitioning the multi-locus genotypes into a high-risk group and a low-risk group according to whether the genotype-specific risk ratio exceeds a fixed threshold or not. Alternatively, one can maximize the χ2 value exhaustively over all possible ways of partitioning the multi-locus genotypes into two groups, and we aim to show that this is computationally feasible. Methods: We advocate finding the optimal MDR (OMDR) that would have resulted from an exhaustive search over all possible ways of partitioning the multi-locus genotypes into two groups. It is shown that this optimal MDR can be obtained efficiently using an ordered combinatorial partitioning (OCP) method, which differs from the existing MDR method in the use of a data-driven rather than fixed threshold. The generalized extreme value distribution (GEVD) theory is applied to find the optimal order of gene combination and assess statistical significance of interactions. Results: The computational complexity of OCP strategy is linear in the number of multi-locus genotypes in contrast with an exponential order for the naive exhaustive search strategy. Simulation studies show that OMDR can be more powerful than MDR with substantial power gain possible when the partitioning of OMDR is different from that of MDR. The analysis results of a breast cancer dataset show that the use of GEVD accelerates the determination of interaction order and reduces the time cost for P-value calculation by more than 10-fold. Availability: C++ program is available at http://home.ustc.edu.cn/∼zhanghan/ocp/ocp.html Contact: zhanghan@mail.ustc.edu.cn Supplementary Information:Supplementary data are available at Bioinformatics online.
Bibliography:ArticleID:btq290
To whom correspondence should be addressed.
ark:/67375/HXZ-CJJ96JGN-T
Associate Editor: Martin Bishop
istex:0A4D199DF1010E22F5AFEF94C630AA4B4962E09D
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1367-4803
1460-2059
1367-4811
DOI:10.1093/bioinformatics/btq290