Use of Self-Healing Techniques for Highly-Available Distributed Monitoring

Authors

  • Włodzimierz Funika Faculty of Computer Science, Electronics and Telecommunication, AGH University of Science and Technology, 30-059 Krakow

Keywords:

Monitoring, self-healing, distributed systems, reliability, high availability

Abstract

The paper addresses the self-healing aspects of the monitoring systems. Nowadays, when the complex distributed systems are concerned, the monitoring system should become "intelligent" - as the first step it can guide the user what should be monitored. The next level of the "intelligence" can be described by the term "self-healing". The goal is to provide the capability that a decision made automatically by the monitoring system should force the system under monitoring to behave more stable, reliable and predictable. In the paper a new monitoring system is presented: AgeMon is an agent based, distributed monitoring system with strictly defined roles which can be performed by the agents. In the paper we discuss self-healing in the context of monitoring. When the self-healing of the monitoring system is concerned, a good example is the case where it is possible to lose the monitoring data due to the storage problems. AgeMon can handle such problems and automatically elects substitute persistence agents to store the data.

Downloads

Download data is not yet available.

Author Biography

Włodzimierz Funika, Faculty of Computer Science, Electronics and Telecommunication, AGH University of Science and Technology, 30-059 Krakow

Department of Computer Science

Downloads

Published

2018-07-03

How to Cite

Funika, W. (2018). Use of Self-Healing Techniques for Highly-Available Distributed Monitoring. Computing and Informatics, 37(2), 424–456. Retrieved from http://147.213.75.17/ojs/index.php/cai/article/view/2018_2_424