ADAPTIVE DISTRIBUTED BREAKPOINT DETECTION AND CHECKPOINT SPACE REDUCTION IN MESSAGE PASSING PROGRAMS

Authors

  • Chyi Ren Dow
  • Cheng Min Lin

Abstract

Breakpoint setting is one of the fundamental mechanisms for debugging programs; however, the detection of breakpoints in distributed programs is more difficult than  in sequential programs. To identify program errors, the status of a distributed program must be rolled back to its earliest global state after detecting breakpoints. Breakpoints are considered as checkpoints in this work so the techniques of finding minimum consistent global checkpoint can be applied to find the earliest consistent global states. Four detection schemes for different types of breakpoints are developed, including disjunctive,  stable conjunctive,  generic conjunctive, and  unconditional breakpoints. In order to reduce the checkpoint space, a typed checkpoint prevention scheme and a causal garbage collection scheme are also presented. Results obtained from a variety of experiments demonstrate that the combination of the prevention and garbage collection techniques can reduce the checkpoint space to a reasonable size.

Downloads

Download data is not yet available.

Published

2012-03-01

How to Cite

Dow, C. R., & Lin, C. M. (2012). ADAPTIVE DISTRIBUTED BREAKPOINT DETECTION AND CHECKPOINT SPACE REDUCTION IN MESSAGE PASSING PROGRAMS. Computing and Informatics, 19(6), 547–568. Retrieved from http://147.213.75.17/ojs/index.php/cai/article/view/577