Search Results - "Jeannot, Emmanuel"
-
1
An introspection monitoring library to improve MPI communication time
Published in The Journal of supercomputing (01-07-2023)“…In this paper, we describe how to improve communication time of MPI parallel applications with the use of a library that enables to monitor MPI applications…”
Get full text
Journal Article -
2
IO-aware Job-Scheduling: Exploiting the Impacts of Workload Characterizations to select the Mapping Strategy
Published in The international journal of high performance computing applications (01-07-2023)“…In high performance, computing concurrent applications are sharing the same file system. However, the bandwidth which provides access to the storage is…”
Get full text
Journal Article -
3
Scheduling periodic I/O access with bi-colored chains: models and algorithms
Published in Journal of scheduling (01-10-2021)“…Observations show that some HPC applications periodically alternate between (i) operations (computations, local data accesses) executed on the compute nodes,…”
Get full text
Journal Article -
4
Evaluation and Optimization of the Robustness of DAG Schedules in Heterogeneous Environments
Published in IEEE transactions on parallel and distributed systems (01-04-2010)“…A schedule is said to be robust if it is able to absorb some degree of uncertainty in task or communication durations while maintaining a stable solution. This…”
Get full text
Journal Article -
5
Study on progress threads placement and dedicated cores for overlapping MPI nonblocking collectives on manycore processor
Published in The international journal of high performance computing applications (01-11-2019)“…To amortize the cost of MPI collective operations, nonblocking collectives have been proposed so as to allow communications to be overlapped with computation…”
Get full text
Journal Article -
6
TAPIOCA: An I/O Library for Optimized Topology-Aware Data Aggregation on Large-Scale Supercomputers
Published in 2017 IEEE International Conference on Cluster Computing (CLUSTER) (01-09-2017)“…Reading and writing data efficiently from storage system is necessary for most scientific simulations to achieve good performance at scale. Many software…”
Get full text
Conference Proceeding -
7
Topology-aware job mapping
Published in The international journal of high performance computing applications (01-01-2018)“…A Resource and Job Management System (RJMS) is a crucial system software part of the HPC stack. It is responsible for efficiently delivering computing power to…”
Get full text
Journal Article -
8
Grid'5000: A Large Scale And Highly Reconfigurable Experimental Grid Testbed
Published in The international journal of high performance computing applications (2006)“…Large scale distributed systems such as Grids are difficult to study from theoretical models and simulators only. Most Grids deployed at large scale are…”
Get full text
Journal Article -
9
Process mapping on any topology with TopoMatch
Published in Journal of parallel and distributed computing (01-12-2022)“…•We present TopoMatch.•TopoMatch is a tool and algorithm to perform process and thread mapping.•We Show that TopoMatch can use any type of topologies.•We show…”
Get full text
Journal Article -
10
Process Placement in Multicore Clusters:Algorithmic Issues and Practical Techniques
Published in IEEE transactions on parallel and distributed systems (01-04-2014)“…Current generations of NUMA node clusters feature multicore or manycore processors. Programming such architectures efficiently is a challenge because numerous…”
Get full text
Journal Article -
11
Adding topology and memory awareness in data aggregation algorithms
Published in Future generation computer systems (01-10-2024)“…With the growing gap between computing power and the ability of large-scale systems to ingest data, I/O is becoming the bottleneck for many scientific…”
Get full text
Journal Article -
12
Symbolic mapping and allocation for the Cholesky factorization on NUMA machines: Results and optimizations
Published in The International journal of high performance computing applications (01-08-2013)“…We discuss some performance issues of the tiled Cholesky factorization on non-uniform memory access-time (NUMA) shared memory machines. We show how to optimize…”
Get full text
Journal Article Conference Proceeding -
13
Tracing task‐based runtime systems: Feedbacks from the StarPU case
Published in Concurrency and computation (01-02-2024)“…Summary Given the complexity of current supercomputers and applications, being able to trace application executions to understand their behavior is not a…”
Get full text
Journal Article -
14
Modeling Non-Uniform Memory Access on Large Compute Nodes with the Cache-Aware Roofline Model
Published in IEEE transactions on parallel and distributed systems (01-06-2019)“…NUMA platforms, emerging memory architectures with on-package high bandwidth memories bring new opportunities and challenges to bridge the gap between…”
Get full text
Journal Article -
15
Experimenting task-based runtimes on a legacy Computational Fluid Dynamics code with unstructured meshes
Published in Computers & fluids (15-09-2018)“…•Discussion on porting a Legacy CFD code onto task-based runtime system.•Porting gradient reconstruction onto StarPU and PARSEC.•Comparing the PARSEC and…”
Get full text
Journal Article -
16
A methodology for assessing computation/communication overlap of MPI nonblocking collectives
Published in Concurrency and computation (10-10-2022)“…Summary By allowing computation/communication overlap, MPI nonblocking collectives (NBC) are supposed to improve application scalability and performance…”
Get full text
Journal Article -
17
H2M: Exploiting Heterogeneous Shared Memory Architectures
Published in Future generation computer systems (01-11-2023)“…Over the past decades, the performance gap between the memory subsystem and compute capabilities continued to spread. However, scientific applications and…”
Get full text
Journal Article -
18
Foreword to the Special Issue of the Twenty Sixth International Heterogeneity in Computing Workshop (HCW) and to the Fifteenth International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar)
Published in Concurrency and computation (2018)“…Heterogeneity is emerging as one of the most profound and challenging characteristics of today's parallel environments. As most modern computing systems are…”
Get full text
Journal Article -
19
READYS: A Reinforcement Learning Based Strategy for Heterogeneous Dynamic Scheduling
Published in 2021 IEEE International Conference on Cluster Computing (CLUSTER) (01-09-2021)“…In this paper, we propose READYS, a reinforcement learning algorithm for the dynamic scheduling of computations modeled as a Directed Acyclic Graph (DAGs). Our…”
Get full text
Conference Proceeding -
20
Topology-Aware Data Aggregation for Intensive I/O on Large-Scale Supercomputers
Published in 2016 First International Workshop on Communication Optimizations in HPC (COMHPC) (01-11-2016)“…Reading and writing data efficiently from storage systems is critical for high performance data-centric applications. These I/O systems are being increasingly…”
Get full text
Conference Proceeding