I have some alerts that look at CPU and RAM every minute and send an email if CPU or RAM has been above 90% for 5 minutes.
But I notice the polling interval for statistics is 10 minutes.
So I'm trying to understand if the alert is using information gathered from the 10 minute polling interval for node statistics, and if that is the case, then it doesn't make a lot of sense for me to have an alert checking every minute.
Basically, I'm trying to give the Business some idea of how accurate the alerts are.
What we're trying to alert on is when the CPU or RAM has been sitting at above 90% (critical threshold) for over 5 minutes but I am not knowledgeable enough about the product to understand whether the alert is working the way we expect it to, assuming it is pulling data gathered from the node statistics polling interval.