Running Orion 9.5 SP4 on a Windows Server 2003 VM with SP2. VM is a dual core, 2.6ghz server with 4 gig of ram. CPU averages 50%. Memory utilization is around 1.67gb.
I have a total of 4,708 Network Elements. 290 Nodes, 4,027 interfaces and 391 volumes. I've deleted out all of the interfaces that I can, i.e. user switch ports.
I am finding that my alerting is very slow. That's the first symptom anyway. I can shut down an interface and it can take it upwards of 7 minutes to get me an alert. I've verified that it's not just a slow e-mail system by going into Orion System Manager, Advanced Alerts, I continually refresh it after I shut down the interface and this is the time (7 minutes or so) before it shows up in the alerts.
For this particular interface, I have the status polling set to 45 seconds and statistics at 5 minutes. Everything else in my system is set to the default node polling interval of 120 seconds and 90 seconds for interfaces. However, I'm wanting to get that time down as low as possible to get alerts faster.
The alert I'm testing, I have it "Check this Alert every" set to 15 seconds. Once the alert shows up, I get my e-mail right after that, so I know that's working. My Trigger condition has the "Do not trigger this action until condition exists for more than" set to 0 seconds. No Alert Suppression is configured. Time of day is from 12:00 AM to 11:59 pm, 7 days a week.
Now, I say that alerting is the symptom but I think that it is not polling on a timely basis. I say this because when I look at the node and interface in question, either in the web interface or System Manager during this whole time after I have shut the interface down, it is showing it as up. I keep refreshing but it still shows it as up. It's not until right around when the alert happens that it finally shows it as down. So it seems that even though I have the polling interval set low, it's not polling as it should.
Any ideas?