CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets

In the past decades,advances in high-throughput technologies have led to the generation of huge amounts of biological data that require analysis and interpretation.Recently,nonnegative matrix factorization (NMF) has been introduced as an efficient way to reduce the complexity of data as well as to i...

Full description

Saved in:
Bibliographic Details
Published in:Genomics, proteomics & bioinformatics Vol. 12; no. 1; pp. 48 - 51
Main Authors: Liao, Ruiqi, Zhang, Yifan, Guan, Jihong, Zhou, Shuigeng
Format: Journal Article
Language:English
Published: China Elsevier Ltd 01-02-2014
School of Computer Science, Fudan University, Shanghai 200433, China%Department of Computer Science and Technology, Tongji University, Shanghai 200092, China%School of Computer Science, Fudan University, Shanghai 200433, China
Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai 200433, China
Elsevier
Oxford University Press
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In the past decades,advances in high-throughput technologies have led to the generation of huge amounts of biological data that require analysis and interpretation.Recently,nonnegative matrix factorization (NMF) has been introduced as an efficient way to reduce the complexity of data as well as to interpret them,and has been applied to various fields of biological research.In this paper,we present CloudNMF,a distributed open-source implementation of NMF on a MapReduce framework.Experimental evaluation demonstrated that CloudNMF is scalable and can be used to deal with huge amounts of data,which may enable various kinds of a high-throughput biological data analysis in the cloud.CloudNMF is freely accessible at http://admis.fudan.edu.cn/projects/CloudNMF.html.
Bibliography:In the past decades,advances in high-throughput technologies have led to the generation of huge amounts of biological data that require analysis and interpretation.Recently,nonnegative matrix factorization (NMF) has been introduced as an efficient way to reduce the complexity of data as well as to interpret them,and has been applied to various fields of biological research.In this paper,we present CloudNMF,a distributed open-source implementation of NMF on a MapReduce framework.Experimental evaluation demonstrated that CloudNMF is scalable and can be used to deal with huge amounts of data,which may enable various kinds of a high-throughput biological data analysis in the cloud.CloudNMF is freely accessible at http://admis.fudan.edu.cn/projects/CloudNMF.html.
Nonnegative matrix factorization;MapReduce;Bioinformatics
11-4926/Q
ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ISSN:1672-0229
2210-3244
DOI:10.1016/j.gpb.2013.06.001