TY - CONF T1 - Fault injection based on a partial view of the global state of a distributed system Y1 - 1999 A1 - Michel Cukier A1 - Chandra,R. A1 - Henke,D. A1 - Pistole,J. A1 - Sanders,W. H. KW - bounding technique KW - clock synchronization KW - distributed programming KW - distributed software systems KW - fault injection KW - Loki KW - post-runtime analysis KW - program testing KW - program verification KW - software reliability KW - Synchronisation AB - This paper describes the basis for and preliminary implementation of a new fault injector, called Loki, developed specifically for distributed systems. Loki addresses issues related to injecting correlated faults in distributed systems. In Loki, fault injection is performed based on a partial view of the global state of an application. In particular, facilities are provided to pass user-specified state information between nodes to provide a partial view of the global state in order to try to inject complex faults successfully. A post-runtime analysis, using an off-line clock synchronization and a bounding technique, is used to place events and injections on a single global time-line and determine whether the intended faults were properly injected. Finally, observations containing successful fault injections are used to estimate specified dependability measures. In addition to describing the details of our new approach, we present experimental results obtained from a preliminary implementation in order to illustrate Loki's ability to inject complex faults predictably M3 - 10.1109/RELDIS.1999.805093 ER -