I configured an alert to basically tell me if a server is up or down with the following trigger:
Node category is equal to server
node percent loss is greater than 40%
I set the alert actions to send me an email first and then send an SMS through OpsGenie 5 minutes later if it is still down.
The alert works beautifully. If I shut a test server down it will cycle through all the correct steps. Once I bring the server back up it sends me a reset email. Afterwards thats when things go haywire.
The alert begins flapping sending me up and down messages for the next 5 minutes until it finally stops. Is there something I can add to the trigger to prevent this?