Scalability of Network-Failure Resilience: Analysis Using Multi-Layer Probabilistic Graphical Models

In this work, we quantify scalability of network resilience upon failures. We characterize resilience as the percentage of lost traffic upon failures and define scalability as the growth rate of the percentage of lost traffic with respect to network size, link failure probability, and network traffi...

Full description

Saved in:
Bibliographic Details
Published in:IEEE/ACM transactions on networking Vol. 17; no. 1; pp. 319 - 331
Main Authors: Guanglei Liu, Guanglei Liu, Chuanyi Ji, Chuanyi Ji
Format: Journal Article
Language:English
Published: New York IEEE 01-02-2009
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In this work, we quantify scalability of network resilience upon failures. We characterize resilience as the percentage of lost traffic upon failures and define scalability as the growth rate of the percentage of lost traffic with respect to network size, link failure probability, and network traffic for given failure protection schemes. We apply probabilistic graphical models to characterize statistical dependence between physical-layer failures and the network-layer traffic, and analyze the scalability for large networks of different topologies. We first focus on the scalability of resilience for regular topologies under uniform deterministic traffic with independent and dependent link failures, with and without protection. For large networks with small probabilities of failures and without protection, we show that the scalability of network resilience grows linearly with the average route length and with the "effective" link failure probability. For large networks with 1 + 1 protection, we obtain lower and upper bound of the percentage of lost traffic. We derive approximations of the scalability for arbitrary topologies, and attain close-form analytical results for ring, star, and mesh-torus topologies. We then study network resilience under random traffic with Poisson arrivals. We find that when the network is under light load, the network resilience is reduced to that under uniform deterministic traffic. When the network load is under heavy load, the percentage of lost traffic approaches the marginal probability of link failure. Our scalability analysis shows explicitly how network resilience varies with different factors and provides insights for resilient network design.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ISSN:1063-6692
1558-2566
DOI:10.1109/TNET.2008.925944