Measuring the Performance and Reliability of Production Computational Grids

In this work we report on data gathered via a deployment of a monitoring and benchmarking infrastructure on two production grid platforms, TeraGrid and Geon. Our result show that these production grids are rather unavailable, with success rates for benchmark and application runs between 55% and 80%....

Full description

Saved in:
Bibliographic Details
Published in:2006 7th IEEE/ACM International Conference on Grid Computing pp. 293 - 300
Main Authors: Khalili, O., Jiahua He, Olschanowsky, C., Snavely, A., Casanova, H.
Format: Conference Proceeding
Language:English
Published: IEEE 01-09-2006
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In this work we report on data gathered via a deployment of a monitoring and benchmarking infrastructure on two production grid platforms, TeraGrid and Geon. Our result show that these production grids are rather unavailable, with success rates for benchmark and application runs between 55% and 80%. We also found that performance fluctuation was in the 50% range, expectedly mostly due to batch schedulers. We also investigate whether the execution time of a typical grid application can be predicated based on previous runs of simple benchmarks. Perhaps surprisingly, we find that application execution time can be predicted with a relative error as low as 9%
ISBN:9781424403431
142440343X
ISSN:2152-1085
2152-1093
DOI:10.1109/ICGRID.2006.311028