I'm getting events from some old slow devices, for example:
Node XXXXX has dropped its average response time from above 200ms to 62 ms which falls below the 100ms threshold.
Where this 100ms threshold comes? I've adjusted alert manager to alert only when response time drops to over 300ms, but that doesn't help. I'm still getting these 100ms threshold notifications, which is quite useless.
I've also adjusted Orion thresholds to 1000ms from web console but that doesn't help either.
What is the right place to adjust these things? It also would be great if I could adjust some devices to lower threshold. For example LAN swithes normal response time is 5ms, so 1000 is way too high. It wouldn't tell anything, if adjusted like that. But in the other hand, I wouldn't like to have alerts/events from every WAN devices if threshold is adjusted to 100 for example. The point is, I'd like to have events/alerts only if something is really wrong. If I'm getting tens of useless events all the time, I probably miss some real problems.