Search Results - "Vella, Flavio"
-
1
Scaling betweenness centrality using communication-efficient sparse matrix multiplication
Published in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (12-11-2017)“…Betweenness centrality (BC) is a crucial graph problem that measures the significance of a vertex by the number of shortest paths leading through it. We…”
Get full text
Conference Proceeding -
2
multi-GPU aggregation-based AMG preconditioner for iterative linear solvers
Published in IEEE transactions on parallel and distributed systems (01-08-2023)“…We present and release in open source format a sparse linear solver which efficiently exploits heterogeneous parallel computers. The solver can be easily…”
Get full text
Journal Article -
3
Scalable Energy Games Solvers on GPUs
Published in IEEE transactions on parallel and distributed systems (01-12-2021)“…Modeling the consumption of limited resources, e.g., time or energy, plays a central role on the design of reactive systems such as embedded controllers. To…”
Get full text
Journal Article -
4
Strategies and systems towards grids and clouds integration:A DBMS-based solution
Published in Future generation computer systems (01-11-2018)“…Cloud and Grid computing share some essential driving ideas although the computing and economic models are very different. In this paper, we propose different…”
Get full text
Journal Article -
5
Solutions to the st-connectivity problem using a GPU-based distributed BFS
Published in Journal of parallel and distributed computing (01-02-2015)“…The st-connectivity problem (ST-CON) is a decision problem that asks, for vertices and in a graph, if is reachable from . Although originally defined for…”
Get full text
Journal Article -
6
Multilevel Parallelism for the Exploration of Large-Scale Graphs
Published in IEEE transactions on multi-scale computing systems (01-07-2018)“…We present the most recent release of our parallel implementation of the BFS and BC algorithms for the study of large scale graphs. Although our reference…”
Get full text
Journal Article -
7
The AES Implantation Based on OpenCL for Multi/many Core Architecture
Published in 2010 International Conference on Computational Science and Its Applications (2010)“…In this article we present a study on an implementation, named clAES, of the symmetric key cryptography algorithm Advanced Encryption Standard (AES) using the…”
Get full text
Conference Proceeding -
8
Gauss-Newton Natural Gradient Descent for Physics-Informed Computational Fluid Dynamics
Published 16-02-2024“…We propose Gauss-Newton's method in function space for the solution of the Navier-Stokes equations in the physics-informed neural network (PINN) framework…”
Get full text
Journal Article -
9
State of practice: evaluating GPU performance of state vector and tensor network methods
Published 11-01-2024“…The frontier of quantum computing (QC) simulation on classical hardware is quickly reaching the hard scalability limits for computational feasibility…”
Get full text
Journal Article -
10
cuVegas: Accelerate Multidimensional Monte Carlo Integration through a Parallelized CUDA-based Implementation of the VEGAS Enhanced Algorithm
Published 17-08-2024“…This paper introduces cuVegas, a CUDA-based implementation of the Vegas Enhanced Algorithm (VEGAS+), optimized for multi-dimensional integration in GPU…”
Get full text
Journal Article -
11
On the Efficacy of Surface Codes in Compensating for Radiation Events in Superconducting Devices
Published 15-07-2024“…Reliability is fundamental for developing large-scale quantum computers. Since the benefit of technological advancements to the qubit's stability is…”
Get full text
Journal Article -
12
Multi-GPU aggregation-based AMG preconditioner for iterative linear solvers
Published 04-03-2023“…IEEE Transactions on Parallel and Distributed Systems (2023) We present and release in open source format a sparse linear solver which efficiently exploits…”
Get full text
Journal Article -
13
High Performance Unstructured SpMM Computation Using Tensor Cores
Published 21-08-2024“…High-performance sparse matrix-matrix (SpMM) multiplication is paramount for science and industry, as the ever-increasing sizes of data prohibit using dense…”
Get full text
Journal Article -
14
Scaling Expected Force: Efficient Identification of Key Nodes in Network-Based Epidemic Models
Published in 2024 32nd Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) (20-03-2024)“…Structural centrality measures are often used to approximate or predict dynamical influence in a network. The recently proposed Expected Force of Infection…”
Get full text
Conference Proceeding -
15
The Landscape of GPU-Centric Communication
Published 15-09-2024“…In recent years, GPUs have become the preferred accelerators for HPC and ML applications due to their parallelism and fast memory bandwidth. While GPUs boost…”
Get full text
Journal Article -
16
Blocking Techniques for Sparse Matrix Multiplication on Tensor Accelerators
Published 11-02-2022“…Tensor accelerators have gained popularity because they provide a cheap and efficient solution for speeding up computational-expensive tasks in Deep Learning…”
Get full text
Journal Article -
17
The potential of high-performance computing for the Internet of Sounds
Published in 2023 4th International Symposium on the Internet of Sounds (26-10-2023)“…High-Performance Computing (HPC) technology is impacting several industries, including the creative industries and those operating in the Internet of Things…”
Get full text
Conference Proceeding -
18
Scaling Expected Force: Efficient Identification of Key Nodes in Network-based Epidemic Models
Published 01-06-2023“…Centrality measures are fundamental tools of network analysis as they highlight the key actors within the network. This study focuses on a newly proposed…”
Get full text
Journal Article -
19
Asynchronous Distributed-Memory Triangle Counting and LCC with RMA Caching
Published in 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS) (01-05-2022)“…Triangle count and local clustering coefficient are two core metrics for graph analysis. They find broad application in analyses such as community detection…”
Get full text
Conference Proceeding -
20
Blocking Sparse Matrices to Leverage Dense-Specific Multiplication
Published in 2022 IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms (IA3) (01-11-2022)“…Research to accelerate matrix multiplication, pushed by the growing computational demands of deep learning, has sprouted many efficient architectural…”
Get full text
Conference Proceeding