Afternoon all,
I have an issue with NPM, We're running APM and NCM along side.
We have got an unlimited licence for the products, and are currently running in evaulation before registering them until we can get this bug worked out.
The server is a HP DL380 G6 with 8 physical and 8 Hyper Threaded processors, Running 24GB of RAM.
Operating system is on 2x 300GB DP SAS in raid 1
There is a 300GB Page File on 2x 300GB DP SAS in raid 1
The SolarWinds data is stored on a 3x 300GB DP SAS in raid 5.
Both the SQL DB and the IIS data is stored on the raid 5.
We are using SQL 2008 R2 Express which gives us 10GB of usable space over the 4GB from 2003. We plan to migrate to a full SQL 2008 R2 install when we get close to the limit.
The DB is currently 2.5GB in size, and we are currently monitoring:
Network Elements : 2059
Nodes : 254
Interfaces : 1767
Volumes : 38
We are running a single polling engine, Which is currently at 43%.
We are monitoring Cisco routers and switches via SNMP V2c, 3 HP SAN's via SNMP V2c, and i'm starting to add Windows Servers (2003 and 2008 R2) via WMI.
Polling Settings:
Default Node Poll Interval : 30 seconds
Default Interface Poll Interval : 30 seconds
Default Volume Poll Interval : 120 seconds
Default Rediscovery Interval : 15 minutes
Default Node Statistics Poll Interval : 10 minutes
Default Interface Statistics Poll Interval : 10 minutes
Default Volume Statistics Poll Interval : 15 minutes
The server is averaging around 10% cpu utilisation across all 16 cores, and around 6GB of RAM usage.
We have 10 users of SolarWinds, and 4 large TV's showing various maps and data.
We are having issues with horrifically slow refresh rates for the users pulling in node details.
One screen is pulling the Message Centre page, and is taking approximately 15-20 seconds to refresh most of the time, the map screens are taking around 40-50 seconds to refresh dependant on network status.
I turned on caching for aspx files in the IIS manager. This caused issues with the maps, Due to them being on a single file i couldn't exclude from the cache. However this sped up the reload time for pages considerably, taking around 1-2 seconds to load, which is where i would expect it to be given the performance of the hardware SolarWinds has to run on.
I also increased the IIS worker threads, to 16 from 1. This again caused issues with the maps where they complained if it wasn't set to 1!
I'm starting to get questions now if we spent £50,000 on something that is unable to perform how we want.
We are getting a further 4 staff, and the usage of SolarWinds will continue to get heavier.