Fault Models for Engineering

01.04.2008 - 31.03.2010
Forschungsförderungsprojekt
As (networked embedded) distributed systems become more and more prevalent in our daily lives, they must meet stringent requirements on their reliability and safety. Such demands can only be met with suitable fault tolerance mechanisms, which led to the emergence of a large and active scientific community concerned with fault tolerance in distributed systems. At the same time, an even larger and equally active, albeit disjoint, engineering community is concerned with implementing fault tolerance mechanisms. Unfortunately, at the moment there exists a fairly large gap between the fault models employed by the scientific community and those required by system engineers: The former talk about abstract notions like omission failures, the latter work with tangible metrics like mean time to failure of specific components. Scientists investigate fault resilience, e.g., "algorithm A requires n > 2t nodes to tolerate t faults", engineers think in terms of system coverage, e.g., "if algorithm A is executed in system S, then the probability of failure is at most p=10^-3". This makes taking over new scientific results into engineering solutions difficult and thus a large potential for improvements is lost. The project Fault Models for Engineering (FAME) is therefore dedicated to the development of more realistic fault models for distributed systems and accompanying translation tools that can directly be used by system engineers to determine the fault coverage in an actual system. FAME thus aims at providing the missing link between the two communities, first by identifying more appropriate scientific fault models, and second by exploring how such fault models can be reconciled with the coverage information engineers need.

Personen

Projektleiter_in

Institut

Grant funds

  • FFG - Österr. Forschungsförderungs- gesellschaft mbH (National) Austrian Research Promotion Agency (FFG)

Forschungsschwerpunkte

  • Computer Engineering: 100%

Schlagwörter

DeutschEnglisch
(Eingebettete) Verteilte Systeme(Embedded) Distributed Systems
FehlermodelleFault Models

Publikationen