Hi All,
We've recently (about 1 month ago) purchased Orion NPM and have been very happy with it to date. I have setup the node UP/DOWN alerts for all our monitored nodes. When a node goes up / down our NOC will receive an email alert. This has been implemented for the last week and has been running in parallel with Statseeker. We bought Orion NPM to replace statseeker.
However, overnight I had received a number of emails from Statseeker saying that certain nodes had gone down and some had come back up, but I had no emails from Orion NPM. This is a concern. Had it not been for Statseeker our comms team would have had no idea there were issues in some of our overseas offices.
I logged into our web console and sure enough NPM was showing 4 nodes that were down, yet I had received no alert notification from NPM. I thought perhaps it was an email with the mail server (NPM and statseeker use the same one). I then tried a test fire of an alert, but did not receive an email and I also noticed that I got no output in the "Test Fire Alert" screen. I then ran up the Orion Service Manager and checked the status of all processes. Everything was running. I then restarted the SolarWinds Alerting Engine and as soon as it restarted I received a flood of emails containing the up/down alerts that I missed and I also noticed the normal output in the test fire window.
For those of you who have been using NPM for a while have you experienced these problems before? Does it happen frequently? I am worried that there is some sort of bug with the Alerting service.
Thanks