Search Results - "Thibault, Samuel"
-
1
Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model
Published in IEEE transactions on parallel and distributed systems (18-12-2017)“…The emergence of accelerators as standard computing resources on supercomputers and the subsequent architectural complexity increase revived the need for…”
Get full text
Journal Article -
2
StarPU: a unified platform for task scheduling on heterogeneous multicore architectures
Published in Concurrency and computation (01-02-2011)“…In the field of HPC, the current hardware trend is to design multiprocessor architectures featuring heterogeneous technologies such as specialized coprocessors…”
Get full text
Journal Article -
3
Taming data locality for task scheduling under memory constraint in runtime systems
Published in Future generation computer systems (01-06-2023)“…A now-classical way of meeting the increasing demand for computing speed by HPC applications is the use of GPUs and/or other accelerators. Such accelerators…”
Get full text
Journal Article -
4
hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications
Published in 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing (01-02-2010)“…The increasing numbers of cores, shared caches and memory nodes within machines introduces a complex hardware topology. High-performance computing applications…”
Get full text
Conference Proceeding -
5
Revisiting dynamic DAG scheduling under memory constraints for shared-memory platforms
Published in 2020 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (01-05-2020)“…This work focuses on dynamic DAG scheduling under memory constraints. We target a shared-memory platform equipped with p parallel processors. We aim at…”
Get full text
Conference Proceeding -
6
Tracing task‐based runtime systems: Feedbacks from the StarPU case
Published in Concurrency and computation (01-02-2024)“…Summary Given the complexity of current supercomputers and applications, being able to trace application executions to understand their behavior is not a…”
Get full text
Journal Article -
7
Programming heterogeneous architectures using hierarchical tasks
Published in Concurrency and computation (15-11-2023)“…Summary Task‐based systems have become popular due to their ability to utilize the computational power of complex heterogeneous systems. A typical programming…”
Get full text
Journal Article -
8
EXA2PRO: A Framework for High Development Productivity on Heterogeneous Computing Systems
Published in IEEE transactions on parallel and distributed systems (01-04-2022)“…Programming upcoming exascale computing systems is expected to be a major challenge. New programming models are required to improve programmability, by hiding…”
Get full text
Journal Article -
9
Faithful performance prediction of a dynamic task-based runtime system for heterogeneous multi-core architectures
Published in Concurrency and computation (01-11-2015)“…Summary Multi‐core architectures comprising several graphics processing units (GPUs) have become mainstream in the field of high‐performance computing…”
Get full text
Journal Article -
10
A visual performance analysis framework for task‐based parallel applications running on hybrid clusters
Published in Concurrency and computation (25-09-2018)“…Summary Programming paradigms in High‐Performance Computing have been shifting toward task‐based models that are capable of adapting readily to heterogeneous…”
Get full text
Journal Article -
11
Structuring the execution of OpenMP applications for multicore architectures
Published in 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS) (01-04-2010)“…The now commonplace multi-core chips have introduced, by design, a deep hierarchy of memory and cache banks within parallel computers as a tradeoff between the…”
Get full text
Conference Proceeding -
12
Towards EXtreme scale technologies and accelerators for euROhpc hw/Sw supercomputing applications for exascale: The TEXTAROSSA approach
Published in Microprocessors and microsystems (01-11-2022)“…In the near future, Exascale systems will need to bridge three technology gaps to achieve high performance while remaining under tight power constraints:…”
Get full text
Journal Article -
13
Resiliency in numerical algorithm design for extreme scale simulations
Published in The international journal of high performance computing applications (01-03-2022)“…This work is based on the seminar titled ‘Resiliency in Numerical Algorithm Design for Extreme Scale Simulations’ held March 1–6, 2020, at Schloss Dagstuhl,…”
Get full text
Journal Article -
14
MASA-StarPU: Parallel Sequence Comparison with Multiple Scheduling Policies and Pruning
Published in 2020 IEEE 32nd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) (01-09-2020)“…Sequence comparison tools based on the Smith-Waterman (SW) algorithm provide the optimal result but have high execution times when the sequences compared are…”
Get full text
Conference Proceeding -
15
From tasks graphs to asynchronous distributed checkpointing with local restart
Published in 2020 IEEE/ACM 10th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS) (01-11-2020)“…The ever-increasing number of computation units assembled in current HPC platforms leads to a concerning increase in fault probability. Traditional…”
Get full text
Conference Proceeding -
16
List Scheduling in Embedded Systems Under Memory Constraints
Published in International journal of parallel programming (01-12-2015)“…Video decoding and image processing in embedded systems are subject to strong resource constraints, particularly in terms of memory. List-scheduling heuristics…”
Get full text
Journal Article -
17
Data-Aware Task Scheduling on Multi-accelerator Based Platforms
Published in 2010 IEEE 16th International Conference on Parallel and Distributed Systems (01-12-2010)“…To fully tap into the potential of heterogeneous machines composed of multicore processors and multiple accelerators, simple offloading approaches in which the…”
Get full text
Conference Proceeding -
18
Asynchronous Task-Based Execution of the Reverse Time Migration for the Oil and Gas Industry
Published in 2019 IEEE International Conference on Cluster Computing (CLUSTER) (01-09-2019)“…We propose a new framework for deploying Reverse Time Migration (RTM) simulations on distributed-memory systems equipped with multiple GPUs. Our software,…”
Get full text
Conference Proceeding -
19
Data-Driven Locality-Aware Batch Scheduling
Published in 2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (27-05-2024)“…Clusters employ workload schedulers such as the Slurm Workload Manager to allocate computing jobs onto nodes. These schedulers usually aim at a good tradeoff…”
Get full text
Conference Proceeding -
20
Memory-Aware Scheduling of Tasks Sharing Data on Multiple GPUs with Dynamic Runtime Systems
Published in 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS) (01-05-2022)“…The use of accelerators such as GPUs has become mainstream to achieve high performance on modern computing systems. GPUs come with their own (limited) memory…”
Get full text
Conference Proceeding