Search Results - "Van der Wijngaart, Rob F"
-
1
Programming many-core architectures - a case study: dense matrix computations on the Intel single-chip cloud computer processor
Published in Concurrency and computation (25-08-2012)“…SUMMARY A message passing, distributed‐memory parallel computer on a chip is one possible design for future, many‐core architectures. We discuss initial…”
Get full text
Journal Article -
2
Performance characteristics of the multi-zone NAS parallel benchmarks
Published in Journal of parallel and distributed computing (01-05-2006)“…We describe a new suite of computational benchmarks that models applications featuring multiple levels of parallelism. Such parallelism is often available in…”
Get full text
Journal Article Conference Proceeding -
3
The 48-core SCC Processor: the Programmer's View
Published in 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (01-11-2010)“…The number of cores integrated onto a single die is expected to climb steadily in the foreseeable future. This move to many-core chips is driven by a need to…”
Get full text
Conference Proceeding -
4
Evaluating Online Global Recovery with Fenix Using Application-Aware In-Memory Checkpointing Techniques
Published in 2016 45th International Conference on Parallel Processing Workshops (ICPPW) (01-08-2016)“…Exascale systems promise the potential for computation at unprecedented scales and resolutions, but achieving exascale by the end of this decade presents…”
Get full text
Conference Proceeding -
5
The Parallel Research Kernels
Published in 2014 IEEE High Performance Extreme Computing Conference (HPEC) (01-09-2014)“…We present the Parallel Research Kernels; a collection of kernels supporting research on parallel computer systems. This set of kernels covers the most common…”
Get full text
Conference Proceeding -
6
Design and Implementation of a Parallel Research Kernel for Assessing Dynamic Load-Balancing Capabilities
Published in 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS) (01-05-2016)“…The Parallel Research Kernels (PRK) are a tool to study parallel architectures and runtime systems from an application perspective. It provides paper and…”
Get full text
Conference Proceeding -
7
NAS Grid Benchmarks: A Tool for Grid Space Exploration
Published in Cluster computing (01-07-2002)“…We present a benchmark suite for computational Grids. It is based on the NAS Parallel Benchmarks (NPB) and is called NAS Grid Benchmark (NGB) in this paper. We…”
Get full text
Journal Article -
8
Extending the BT NAS Parallel Benchmark to exascale computing
Published in 2012 International Conference for High Performance Computing, Networking, Storage and Analysis (01-11-2012)“…The NAS Parallel Benchmarks (NPB) are a well-known suite of benchmarks that proxy scientific computing applications. They specify several problem sizes that…”
Get full text
Conference Proceeding -
9
Using the Parallel Research Kernels to Study PGAS Models
Published in 2015 9th International Conference on Partitioned Global Address Space Programming Models (01-09-2015)“…A subset of the Parallel Research Kernels (PRK),simplified parallel application patterns, are used to study the behavior of different runtimes implementing the…”
Get full text
Conference Proceeding -
10
NAS Grid Benchmarks: a tool for Grid space exploration
Published in High Performance Distributed Computing: Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing; 07-09 Aug. 2001 (01-01-2001)“…We present a benchmark suite for computational grids in this paper. It is based on the NAS Parallel Benchmarks (NPB) and is called the NAS Grid Benchmark…”
Get full text
Conference Proceeding Journal Article -
11
Analysis and Optimization of Software Pipeline Performance on MIMD Parallel Computers
Published in Journal of parallel and distributed computing (10-10-1996)“…Observations show that fine-grain software pipelines on MIMD parallel computers with asynchronous communication suffer from dynamic load imbalances which cause…”
Get full text
Journal Article -
12
Minimizing Cache Misses in Scientific Computing Using Isoperimetric Bodies
Published 23-05-2002“…A number of known techniques for improving cache performance in scientific computations involve the reordering of the iteration space. Some of these…”
Get full text
Journal Article -
13
Efficient cache use for stencil operations on structured discretization grids
Published 14-07-2000“…We derive tight bounds on cache misses for evaluation of explicit stencil operators on structured grids. Our lower bound is based on the isoperimetrical…”
Get full text
Journal Article