Hi all, I'm trying to configure some alerts on my servers but I'm facing some difficulty on configuring it based on the requirement my company wants. E.g.
Alerts to send email if server CPU reaches more than 70%. Alerts engineer every 4 hours, server CPU must stay more than 70% for more than 1minute.
Based those information, I set as below
Evaluation Frequency of alert - 4hrs
Select options "Condition must exist for more than" - 1mins
By right it should work correct?
However after several days I notice some problem, none of my engineer receive any alerts even though the server CPU reaches more than 70% I run checks on the historical utilization and indeed there's several times where the CPU hit 80%-90% for more than 10mins but no alerts were triggered. Anyone knows why is that so? I'm suspect this is due to the option "Evaluation Frequency of alert - 4hrs" cause SW could be scanning for this alerts every 4 hours, so the issue occur and ended less than 4 hours then SW will not capture it.
Is this true? If yes then any other way I could configure the alert based on my requirements provided SW must be actively scanning the servers for all this alerts.