Search Results - "Touriño, Juan"
-
1
SparkEC: speeding up alignment-based DNA error correction tools
Published in BMC bioinformatics (07-11-2022)“…Abstract Background In recent years, huge improvements have been made in the context of sequencing genomic data under what is called Next Generation Sequencing…”
Get full text
Journal Article -
2
PATO: genome-wide prediction of lncRNA–DNA triple helices
Published in Bioinformatics (Oxford, England) (01-03-2023)“…Abstract Motivation Long non-coding RNA (lncRNA) plays a key role in many biological processes. For instance, lncRNA regulates chromatin using different…”
Get full text
Journal Article -
3
SeQual-Stream: approaching stream processing to quality control of NGS datasets
Published in BMC bioinformatics (27-10-2023)“…Quality control of DNA sequences is an important data preprocessing step in many genomic analyses. However, all existing parallel tools for this purpose are…”
Get full text
Journal Article -
4
Real-time resource scaling platform for Big Data workloads on serverless environments
Published in Future generation computer systems (01-04-2020)“…The serverless execution paradigm is becoming an increasingly popular option when workloads are to be deployed in an abstracted way, more specifically, without…”
Get full text
Journal Article -
5
pRIblast: A highly efficient parallel application for comprehensive lncRNA–RNA interaction prediction
Published in Future generation computer systems (01-01-2023)“…Long non-coding RNAs (lncRNAs) play a key role in several biological processes and scientists are constantly trying to come up with new strategies to elucidate…”
Get full text
Journal Article -
6
Parallel-FST: A feature selection library for multicore clusters
Published in Journal of parallel and distributed computing (01-11-2022)“…Feature selection is a subfield of machine learning focused on reducing the dimensionality of datasets by performing a computationally intensive process. This…”
Get full text
Journal Article -
7
BDEv 3.0: Energy efficiency and microarchitectural characterization of Big Data processing frameworks
Published in Future generation computer systems (01-09-2018)“…As the size of Big Data workloads keeps increasing, the evaluation of distributed frameworks becomes a crucial task in order to identify potential performance…”
Get full text
Journal Article -
8
MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems
Published in Bioinformatics (Oxford, England) (15-12-2016)“…MSAProbs is a state-of-the-art protein multiple sequence alignment tool based on hidden Markov models. It can achieve high alignment accuracy at the expense of…”
Get full text
Journal Article -
9
Parallel feature selection for distributed-memory clusters
Published in Information sciences (01-09-2019)“…•Feature selection is an important data mining stage in the field of machine learning.•Fast-mRMR-MPI, a tool to accelerate feature selection on clusters, is…”
Get full text
Journal Article -
10
ParRADMeth: Identification of Differentially Methylated Regions on Multicore Clusters
Published in IEEE/ACM transactions on computational biology and bioinformatics (01-05-2023)“…The discovery of Differentially Methylated (DM) regions is an important research field in biology, as it can help to anticipate the risk of suffering from…”
Get full text
Journal Article -
11
SMusket: Spark-based DNA error correction on distributed-memory systems
Published in Future generation computer systems (01-10-2020)“…Next-Generation Sequencing (NGS) technologies have revolutionized genomics research over the last decade, bringing new opportunities for scientists to perform…”
Get full text
Journal Article -
12
Serverless-like platform for container-based YARN clusters
Published in Future generation computer systems (01-06-2024)“…Serverless computing is an emerging paradigm that has gained a lot of relevance in recent years, as it allows users to consume computing resources without…”
Get full text
Journal Article -
13
HSRA: Hadoop-based spliced read aligner for RNA sequencing data
Published in PloS one (31-07-2018)“…Nowadays, the analysis of transcriptome sequencing (RNA-seq) data has become the standard method for quantifying the levels of gene expression. In RNA-seq…”
Get full text
Journal Article -
14
A pipeline architecture for feature-based unsupervised clustering using multivariate time series from HPC jobs
Published in Information fusion (01-05-2023)“…Time series are key across industrial and research areas for their ability to model behaviour across time, making them ideal for a wide range of use cases such…”
Get full text
Journal Article -
15
CUDA acceleration of MI-based feature selection methods
Published in Journal of parallel and distributed computing (01-08-2024)“…Feature selection algorithms are necessary nowadays for machine learning as they are capable of removing irrelevant and redundant information to reduce the…”
Get full text
Journal Article -
16
Performance analysis of HPC applications in the cloud
Published in Future generation computer systems (01-01-2013)“…The scalability of High Performance Computing (HPC) applications depends heavily on the efficient support of network communications in virtualized…”
Get full text
Journal Article -
17
BDWatchdog: Real-time monitoring and profiling of Big Data applications and frameworks
Published in Future generation computer systems (01-10-2018)“…Current Big Data applications are characterized by a heavy use of system resources (e.g., CPU, disk) generally distributed across a cluster. To effectively…”
Get full text
Journal Article -
18
Parallel Pairwise Epistasis Detection on Heterogeneous Computing Architectures
Published in IEEE transactions on parallel and distributed systems (01-08-2016)“…Development of new methods to detect pairwise epistasis, such as SNP-SNP interactions, in Genome-Wide Association Studies is an important task in…”
Get full text
Journal Article -
19
Flame-MR: An event-driven architecture for MapReduce applications
Published in Future generation computer systems (01-12-2016)“…Nowadays, many organizations analyze their data with the MapReduce paradigm, most of them using the popular Apache Hadoop framework. As the data size managed…”
Get full text
Journal Article -
20
Enhancing in-memory efficiency for MapReduce-based data processing
Published in Journal of parallel and distributed computing (01-10-2018)“…As the memory capacity of computational systems increases, the in-memory data management of Big Data processing frameworks becomes more crucial for…”
Get full text
Journal Article