Using multirail networks in high-performance clusters

Using multiple independent networks (also known as rails) is an emerging technique which is being used to overcome bandwidth limitations and enhance fault tolerance of current high‐performance parallel computers. In this paper, we present and analyze various algorithms to allocate multiple communica...

Full description

Saved in:
Bibliographic Details
Published in:Concurrency and computation Vol. 15; no. 7-8; pp. 625 - 651
Main Authors: Coll, Salvador, Frachtenberg, Eitan, Petrini, Fabrizio, Hoisie, Adolfy, Gurvits, Leonid
Format: Journal Article
Language:English
Published: Chichester, UK John Wiley & Sons, Ltd 01-06-2003
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Using multiple independent networks (also known as rails) is an emerging technique which is being used to overcome bandwidth limitations and enhance fault tolerance of current high‐performance parallel computers. In this paper, we present and analyze various algorithms to allocate multiple communication rails, including static and dynamic allocation schemes. An analytical lower bound on the number of rails required for static rail allocation is shown. We also present an extensive experimental comparison of the behavior of various algorithms in terms of bandwidth and latency. We show that striping messages over multiple rails can substantially reduce network latency, depending on average message size, network load and allocation scheme. The methods compared include a static rail allocation, a basic round‐robin rail allocation, a local‐dynamic allocation based on local knowledge and a dynamic rail allocation that reserves both communication endpoints of a message before sending it. The last method is shown to perform better than the others at higher loads: up to 49% better than local‐knowledge allocation and 37% better than the round‐robin allocation. This allocation scheme also shows lower latency and it saturates at higher loads (for long enough messages). Most importantly, this proposed allocation scheme scales well with the number of rails and message size. In addition we propose a hybrid algorithm that combines the benefits of the local‐dynamic allocation for short messages with those of the dynamic algorithm for large messages. Copyright © 2003 John Wiley & Sons, Ltd.
Bibliography:ark:/67375/WNG-QBZQQ2L6-K
istex:82F723D76709CC524B16E34E36A24F45A471CDBA
U.S. Department of Energy - No. W-7405-ENG-36
ArticleID:CPE725
A shorter version of this paper and results for the static rail allocation can be found in [1] and [2]
ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ISSN:1532-0626
1532-0634
DOI:10.1002/cpe.725