Fehlertoleranz in einem verteilten Betriebssystem mit transaktionalem Speicher
Dissertation
Faculties
Fakultät für Ingenieurwissenschaften und InformatikAbstract
Creating checkpoints of a distributed cluster operating system is a non-trivial task, as special coordination between cluster nodes is necessary to cope with the domino effect. This additional coordination effort of distributed checkpointing can be greatly reduced by using distributed transactional memory as communication medium between cluster nodes. Based on Rainbow OS, a 64 bit cluster operating system, this work presents an approach to efficient distributed checkpointing which can be executed with minimal impact and concurrent to cluster operations. Checkpointing data is saved on solid state drive (SSD) storage using a specifically tailored algorithm to maximize checkpointing performance and still provide object-oriented access. This work further offers an approach to include device driver data in checkpoints and to consistently restore it in case of system failure with subsequent fallback to a stored checkpoint.
Date created
2012
Subject headings
[GND]: Fehlertoleranz | Verteiltes Betriebssystem[LCSH]: Electronic data processing; Distributed processing | Fault-tolerant computing | Operating systems (Computers)
[Free subject headings]: Checkpointing | Distributed computing | Transactional memory
[DDC subject group]: DDC 004 / Data processing & computer science
Metadata
Show full item recordDOI & citation
Please use this identifier to cite or link to this item: http://dx.doi.org/10.18725/OPARU-2459
Gerhold, Steffen (2013): Fehlertoleranz in einem verteilten Betriebssystem mit transaktionalem Speicher. Open Access Repositorium der Universität Ulm und Technischen Hochschule Ulm. Dissertation. http://dx.doi.org/10.18725/OPARU-2459
Citation formatter >