Search Results - "Lima, João V.F."
-
1
An evaluation of relational and NoSQL distributed databases on a low-power cluster
Published in The Journal of supercomputing (01-08-2023)“…The constant growth of social media, unconventional web technologies, mobile applications, and Internet of Things (IoT) devices create challenges for cloud…”
Get full text
Journal Article -
2
Collaborative execution of fluid flow simulation using non-uniform decomposition on heterogeneous architectures
Published in Journal of parallel and distributed computing (01-06-2021)“…The demand for computing power, along with the diversity of computational problems, culminated in a variety of heterogeneous architectures. Among them, hybrid…”
Get full text
Journal Article -
3
NAS Parallel Benchmarks with Python: a performance and programming effort analysis focusing on GPUs
Published in The Journal of supercomputing (01-05-2023)“…Compiled low-level languages, such as C/C++ and Fortran, have been employed as programming tools to implement applications to explore GPU devices. As a…”
Get full text
Journal Article -
4
Design and analysis of scheduling strategies for multi-CPU and multi-GPU architectures
Published in Parallel computing (01-05-2015)“…•We evaluated four scheduling strategies for multi-CPU and multi-GPU architectures.•We designed a framework with performance models for task and transfer…”
Get full text
Journal Article -
5
Preliminary Experiments with XKaapi on Intel Xeon Phi Coprocessor
Published in 2013 25th International Symposium on Computer Architecture and High Performance Computing (01-10-2013)“…This paper presents preliminary performance comparisons of parallel applications developed natively for the Intel Xeon Phi accelerator using three different…”
Get full text
Conference Proceeding -
6
Performance and Energy Analysis of OpenMP Runtime Systems with Dense Linear Algebra Algorithms
Published in 2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) (01-10-2017)“…In this paper, we analyse performance and energy consumption of four OpenMP runtime systems over a NUMA platform. We present an experimental study to…”
Get full text
Conference Proceeding -
7
Exploiting Concurrent GPU Operations for Efficient Work Stealing on Multi-GPUs
Published in 2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing (01-10-2012)“…The race for Exascale computing has naturally led the current technologies to converge to multi-CPU/multi-GPU computers, based on thousands of CPUs and GPUs…”
Get full text
Conference Proceeding -
8
An evaluation of Cassandra NoSQL database on a low-power cluster
Published in 2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) (01-10-2021)“…The constant growth of social media, unconventional web technologies, mobile applications, and Internet of Things (IoT) devices, create challenges for cloud…”
Get full text
Conference Proceeding -
9
XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures
Published in 2013 IEEE 27th International Symposium on Parallel and Distributed Processing (01-05-2013)“…Most recent HPC platforms have heterogeneous nodes composed of multi-core CPUs and accelerators, like GPUs. Programming such nodes is typically based on a…”
Get full text
Conference Proceeding -
10
A Memory Affinity Analysis of Scientific Applications on NUMA Platforms
Published in 2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) (01-10-2021)“…Understanding the underlying architecture is essential for scientific applications in general. An example of a computing environment is Non-Uniform Memory…”
Get full text
Conference Proceeding -
11
A Dynamic Task-Based D3Q19 Lattice-Boltzmann Method for Heterogeneous Architectures
Published in 2019 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) (01-02-2019)“…Nowadays computing platforms expose a significant number of heterogeneous processing units such as multicore processors and accelerators. The task-based…”
Get full text
Conference Proceeding -
12
Evaluation of two topology-aware heuristics on level- 3 BLAS library for multi-GPU platforms
Published in 2021 SC Workshops Supplementary Proceedings (SCWS) (01-11-2021)“…Nowadays GPUs have dominated the market considering the computing/power metric and numerous research works have provided Basic Linear Algebra Subprograms…”
Get full text
Conference Proceeding -
13
HPSM: A Programming Framework for Multi-CPU and Multi-GPU Systems
Published in 2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) (01-10-2017)“…This paper presents a high-level C++ framework to explore multi-CPU and multi-GPU systems called HPSM. HPSM enables parallel loops and reductions implemented…”
Get full text
Conference Proceeding -
14
XKBlas: a High Performance Implementation of BLAS-3 Kernels on Multi-GPU Server
Published in 2020 28th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) (01-03-2020)“…In the last ten years, GPUs have dominated the market considering the computing/power metric and numerous research works have provided Basic Linear Algebra…”
Get full text
Conference Proceeding