Building Fault-tolerant Distributed Applications with Durra

``Building Fault-tolerant Distributed Applications with Durra'' by M. Barbacci and others. In Intl. Workshop on Configurable Dist. Systems, (London, England), Mar. 1992, pp. 128-139. Also in [swej93mar], pages 83-94.
Annotation: Durra describes an application as a set of components (application tasks and communication channels), a set of alternative configurations showing how these components are connected at runtime, and a set of conditional configuration transitions that take place at runtime. A ``cluster'' is a physical grouping of components at a node; a ``cluster manager'' is responsible for starting and terminating application processes and links, for passing messages between components, for monitoring reconfiguration conditions, and for carrying out reconfigurations. The Durra runtime requires that processes be quiescent before reconfiguring; Durra relies on processes to declare themselves quiescent explicitly by making a call to their cluster managers. If a process does not quiesce in a timely manner, the cluster manager times out. While Durra tolerates component failures, it does not tolerate cluster failures.

BibTeX entry:

   author = {M. Barbacci and others},
   title = {Building Fault-tolerant Distributed Applications with {Durra}},
   booktitle = {Intl. Workshop on Configurable Dist. Systems},
   pages = {128--139},
   address = {London, England},
   month = mar,
   year = {1992},
   note = {Also in [swej93mar], pages 83--94}

Also see software upgrades publications by date.

Sameer Ajmani