Availability has become a key requirement for many modern IT systems. Computer operating systems play an impor- tant role in IT systems availability. Due to the complexity of their architecture, they are prone to suffer failures due to several types of software faults. Software aging causes a non-negligible fraction of these failures. It leads to an accumulation of errors with time, increasing the system failure rate. This phenomenon can be accompanied by performance degradation and eventually system hang or even crash. As a countermeasure, software rejuvenation entails stopping the system, cleaning up its internal state, and resuming its operation. This process usually incurs downtime. For an operating system, the downtime impacts any application running on top of it. Several solutions have been developed to speed up the boot time of operating systems in order to reduce the downtime overhead. We present a study of two fast OS reboot techniques for rejuvenation of Linux-based operating systems, namely Kexec and Phase-based reboot. The study measures the performance penalty they introduce and the gain in reduction of downtime overhead. The results reveal that the Kexec and Phase-based reboot have no statistically significant impact in terms of per- formance penalty from the user perspective. However, they may require extra resource (e.g., CPU) usage. The downtime overhead reduction, compared with normal Linux and VM reboots, is 77% and 79% in Kexec and Phase-based reboot, respectively.

Towards Fast OS Rejuvenation: an Experimental Evaluation of Fast OS Reboot Techniques / Bovenzi, Antonio; J., Alonso; H., Yamada; Russo, Stefano; K. S., Trivedi. - (2013), pp. 61-70. (Intervento presentato al convegno 2013 IEEE 24th International Symposium on Software Reliability Engineering (ISSRE 2013) tenutosi a Pasadena, CA (USA) nel 4-7 Novembre 2013) [10.1109/ISSRE.2013.6698905].

Towards Fast OS Rejuvenation: an Experimental Evaluation of Fast OS Reboot Techniques

BOVENZI, ANTONIO;RUSSO, STEFANO;
2013

Abstract

Availability has become a key requirement for many modern IT systems. Computer operating systems play an impor- tant role in IT systems availability. Due to the complexity of their architecture, they are prone to suffer failures due to several types of software faults. Software aging causes a non-negligible fraction of these failures. It leads to an accumulation of errors with time, increasing the system failure rate. This phenomenon can be accompanied by performance degradation and eventually system hang or even crash. As a countermeasure, software rejuvenation entails stopping the system, cleaning up its internal state, and resuming its operation. This process usually incurs downtime. For an operating system, the downtime impacts any application running on top of it. Several solutions have been developed to speed up the boot time of operating systems in order to reduce the downtime overhead. We present a study of two fast OS reboot techniques for rejuvenation of Linux-based operating systems, namely Kexec and Phase-based reboot. The study measures the performance penalty they introduce and the gain in reduction of downtime overhead. The results reveal that the Kexec and Phase-based reboot have no statistically significant impact in terms of per- formance penalty from the user perspective. However, they may require extra resource (e.g., CPU) usage. The downtime overhead reduction, compared with normal Linux and VM reboots, is 77% and 79% in Kexec and Phase-based reboot, respectively.
2013
9781479923663
Towards Fast OS Rejuvenation: an Experimental Evaluation of Fast OS Reboot Techniques / Bovenzi, Antonio; J., Alonso; H., Yamada; Russo, Stefano; K. S., Trivedi. - (2013), pp. 61-70. (Intervento presentato al convegno 2013 IEEE 24th International Symposium on Software Reliability Engineering (ISSRE 2013) tenutosi a Pasadena, CA (USA) nel 4-7 Novembre 2013) [10.1109/ISSRE.2013.6698905].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/561959
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 24
  • ???jsp.display-item.citation.isi??? 17
social impact