Theses and Dissertations
Issuing Body
Mississippi State University
Advisor
Skjellum, Anthony
Committee Member
Reese, Donna S.
Committee Member
Little, Rainey
Date of Degree
5-10-2003
Document Type
Graduate Thesis - Open Access
Major
Computer Science
Degree Name
Master of Science
College
James Worth Bagley College of Engineering
Department
Department of Computer Science
Abstract
The popularity of MPI-based middleware and applications has led to their wide deployment. Such systems, however, are not inherently reliable and cannot tolerate external faults. This thesis presents a novel model-based approach for exploiting application features and other characteristics to categorize and create AEMs (Application Execution Model). This work realizes MPI/FT(tm), a middleware derived by selective incorporation of fault-tolerant features into MPI/Pro(tm) for two relevant AEMs. This thesis proves the following hypothesis: it is possible to successfully complete select MPI applications even in the presence of external faults, and such fault-tolerance can be achieved with acceptable performance overhead. This work defines parameters to measure the impact of this middleware on performance through faultree and fault-injected overheads. The hypothesis is validated through experimentation and measurement of sample MPI applications for two AEMs.
URI
https://hdl.handle.net/11668/18395
Recommended Citation
Batchu, Rajanikanth Reddy, "Incorporating Fault-Tolerant Features into Message-Passing Middleware" (2003). Theses and Dissertations. 2679.
https://scholarsjunction.msstate.edu/td/2679