I have set alert condition if any of the below condition is not met, should alert me
CPU greater than 80%
Memory greater than 80%
volume free space availability less than 20%
Node status is down
Set it condition must exist for 5 mins.
I was able to get an alert if that node do not met the condition.
I received an email notification saying that a problem was detected on particular node:
ex: Memory: 10%
CPU load: 5%
Node status: down
I have couple of questions here:
How long was this node down?
Did it stay down or come back up after 5 mins later ?
Is there anyway that we could determine these things. Also how can we send email notification once after the node is up again.
Could someone assist on this.
Thanks in advance