Search Results - "SAAVEDRA, R. H"
-
1
Measuring cache and TLB performance and their effect on benchmark runtimes
Published in IEEE transactions on computers (01-10-1995)“…In previous research, we have developed and presented a model for measuring machines and analyzing programs, and for accurately predicting the running time of…”
Get full text
Journal Article -
2
Performance characterization of optimizing compilers
Published in IEEE transactions on software engineering (01-07-1995)“…Optimizing compilers have become an essential component in achieving high levels of performance. Various simple and sophisticated optimizations are implemented…”
Get full text
Journal Article -
3
Adaptive software prefetching in scalable multiprocessors using cache information
Published in Parallel computing (01-08-2001)“…Scalable multiprocessors present special challenges to static software prefetching because on these systems the memory access latency is not completely…”
Get full text
Journal Article -
4
Micro benchmark analysis of the KSR1
Published in Conference on High Performance Networking and Computing: Proceedings of the 1993 ACM/IEEE conference on Supercomputing (01-12-1993)“…The micro benchmark approach is used to analyze the KSR1 and, in particular, the ALLCACHE memory architecture and ring interconnection. The authors have been…”
Get full text
Conference Proceeding -
5
Adaptive granularity : Transparent integration of fine- and coarse-grain communication
Published in International journal of parallel programming (01-10-1997)“…The granularity of shared data is one of the key factors affecting the performance of distributed shared memory machines (DSM). Given that programs exhibit…”
Get full text
Journal Article -
6
Machine characterization based on an abstract high-level language machine
Published in IEEE transactions on computers (01-12-1989)“…Measurements are presented for a large number of machines ranging from small workstations to supercomputers. The authors combine these measurements into groups…”
Get full text
Journal Article -
7
The limits and effectiveness of data prefetching on scalable multiprocessors
Published in Performance evaluation (01-10-1996)Get full text
Journal Article -
8
Adaptive granularity: transparent integration of fine- and coarse-grain communication
Published in Proceedings of the 1996 Conference on Parallel Architectures and Compilation Technique (1996)“…The granularity of shared data is one of the key factors affecting the performance of distributed shared memory machines (DSM). Given that programs exhibit…”
Get full text
Conference Proceeding -
9
Trojan: a high-performance simulator for shared memory architectures
Published in Proceedings of the 29th Annual Simulation Symposium (1996)“…The paper presents an execution driven simulator called Trojan, which is an extended version of MIT Proteus, for evaluating the performance of parallel shared…”
Get full text
Conference Proceeding -
10
The combined effectiveness of unimodular transformations, tiling, and software prefetching
Published in Proceedings of International Conference on Parallel Processing (1996)“…Unimodular transformations, tiling, and software prefetching are loop optimizations known to be effective in increasing parallelism, reducing cache miss rates,…”
Get full text
Conference Proceeding