Search Results - "Hacker, T. J."
-
1
An analysis of clustered failures on large supercomputing systems
Published in Journal of parallel and distributed computing (01-07-2009)“…Large supercomputers are built today using thousands of commodity components, and suffer from poor reliability due to frequent component failures. The…”
Get full text
Journal Article -
2
The end-to-end performance effects of parallel TCP sockets on a lossy wide-area network
Published in Proceedings 16th International Parallel and Distributed Processing Symposium (2002)“…This paper examines the effects of using parallel TCP flows to improve end-to-end network performance for distributed data intensive applications. A series of…”
Get full text
Conference Proceeding -
3
Live Migration of Parallel Applications with OpenVZ
Published in 2011 IEEE Workshops of International Conference on Advanced Information Networking and Applications (01-03-2011)“…A parallel application can terminate or produce incorrect results when a computational node fails. As the number of components in large scale supercomputing…”
Get full text
Conference Proceeding -
4
Improving throughput and maintaining fairness using parallel TCP
Published in IEEE INFOCOM 2004 (2004)“…Applications that require good network performance often use parallel TCP streams and TCP modifications to improve the effectiveness of TCP. If the network…”
Get full text
Conference Proceeding -
5
Adaptive data block scheduling for parallel TCP streams
Published in HPDC-14. Proceedings. 14th IEEE International Symposium on High Performance Distributed Computing, 2005 (2005)“…Applications that use parallel TCP streams to increase throughput must multiplex and demultiplex data blocks over a set of TCP streams transmitting on one or…”
Get full text
Conference Proceeding -
6
Kernel Level Support for Workflow Patterns
Published in 2011 IEEE World Congress on Services (01-07-2011)“…In the evolution of computing technology over the decades, file system capabilities have not grown in tandem to processing power. Today, scientific computing…”
Get full text
Conference Proceeding -
7
Walden: A Scalable Solution for Grid Account Management
Published in Fifth IEEE/ACM International Workshop on Grid Computing (08-11-2004)“…A large and diverse consortium of grid clusters, as can be found in a university setting, requires a flexible authorization model that is scalable, extensible…”
Get full text
Conference Proceeding -
8
Accounting and Accountability for Distributed and Grid Systems
Published in 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02) (2002)Get full text
Conference Proceeding