Search Results - "Du Bois, Kristof"
-
1
The Intel® Programmable and Integrated Unified Memory Architecture (PIUMA) Graph Analytics Processor
Published in IEEE MICRO (01-09-2023)“…High performance large scale graph analytics are essential to timely analyze relationships in big data sets. Conventional processor architectures suffer from…”
Get full text
Journal Article -
2
RIO: ROB-Centric In-Order Modeling of Out-of-Order Processors
Published in IEEE computer architecture letters (01-01-2021)“…Architectural studies of the cache and memory hierarchy need a fast simulation model for the processor core that accurately conveys the impact of memory…”
Get full text
Journal Article -
3
Multi-Stage CPI Stacks
Published in IEEE computer architecture letters (01-01-2018)“…CPI stacks are an intuitive way to visualize processor core performance bottlenecks. However, they often do not provide a full view on all bottlenecks, because…”
Get full text
Journal Article -
4
Many-Core Graph Workload Analysis
Published in SC18: International Conference for High Performance Computing, Networking, Storage and Analysis (01-11-2018)“…Graph applications have specific characteristics that are not common in other application domains and therefore require thorough analysis to guide future graph…”
Get full text
Conference Proceeding -
5
Accurate and Scalable Many-Node Simulation
Published 18-01-2024“…Accurate performance estimation of future many-node machines is challenging because it requires detailed simulation models of both node and network. However,…”
Get full text
Journal Article -
6
Projecting Performance for PIUMA using Down-Scaled Simulation
Published in 2020 IEEE High Performance Extreme Computing Conference (HPEC) (22-09-2020)“…Programmable Integrated Unified Memory Architecture (PIUMA) is Intel's novel graph analysis optimized processor architecture, targeted at efficiently executing…”
Get full text
Conference Proceeding -
7
Extending the Performance Analysis Tool Box: Multi-stage CPI Stacks and FLOPS Stacks
Published in 2018 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) (01-04-2018)“…CPI stacks are an intuitive way to visualize processor core performance bottlenecks. However, they often do not provide a full view on all bottlenecks, because…”
Get full text
Conference Proceeding -
8
Analyzing the scalability of managed language applications with speedup stacks
Published in 2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) (01-04-2017)“…Understanding the reasons why multi-threaded applications do not achieve perfect scaling on modern multicore hardware is challenging. Furthermore, more and…”
Get full text
Conference Proceeding -
9
Speedup stacks: Identifying scaling bottlenecks in multi-threaded applications
Published in 2012 IEEE International Symposium on Performance Analysis of Systems & Software (01-04-2012)“…Multi-threaded workloads typically show sublinear speedup on multi-core hardware, i.e., the achieved speedup is not proportional to the number of cores and…”
Get full text
Conference Proceeding -
10
PIUMA: Programmable Integrated Unified Memory Architecture
Published 13-10-2020“…High performance large scale graph analytics is essential to timely analyze relationships in big data sets. Conventional processor architectures suffer from…”
Get full text
Journal Article