We have a few hundred nodes that are spread out across the country and connected by ISPs that we do not directly manage. We would like to monitor both short term latency and long term latency (for trends).
For short term latency, based on 60 second polling intervals from the Orion engine, I'm using this SWQL:
SELECT TOP Caption, NodeID, AvgResponseTime, MinResponseTime, MaxResponseTime, PercentLoss
FROM Orion.Nodes
WHERE AvgResponseTime > 200 OR MaxResponseTime > 1000
ORDER BY AvgResponseTime DESC
That's fine for the last five or ten minutes, but I'd also like to be able to observe trends based on an hourly or daily basis. For example, if a node was showing an average response time of 250 ms yesterday but is now showing an average response time of 1500 ms, I need a way to flag that. Not necessarily an alert, but something to put in custom html widget would be fine.
We only have NPM.
Any ideas?