We'd previously been using a different tool to monitor the hardware status of our network devices but are in the process of migrating that function to Solarwinds NPM. Out of the ~1k nodes that we currently have onboarded into Solarwinds, just shy of 1/2 of those are showing an overall hardware status of "Node is Up. Overall Hardware Status (Node) 'Overall Hardware Status' has state: Unknown." All of the nodes showing an "Unknown" hardware status are showing valid CPU, memory and interface utilization information. The vendor name (Cisco) and model are also recognized.
I have read through a number of posts with similar topics and have tried all of the suggestions that I could find but nothing thus far seems to have helped. There doesn't seem to be a common theme among the devices that are showing in an unknown state. So far only deleting and re-adding the nodes to Solarwinds seems to resolve the issue (which isn't practical given the number of devices involved and the loss of historical data). Below are some of the things that I have tried.
- Rediscovered the device
- Repolled the device
- Verified that the SNMP community string passes the test in node settings
- Verified that the OID is viewable via the MIB browser
- Compared the SNMP configuration against known working devices of the same model & IOS version
- Compared the node configuration in Solarwinds against known working devices of the same model & IOS version
- Restarted services on all pollers
- Changed the Hardware Health Polling Method from "Use global setting" to "CISCO-ENVMON-MIB" (The default is set to CISCO-ENTITY-SENSOR-MIB) for the node
- Installed the most current set of MIBs from the Customer Portal (restarted services after install)
- Verified that the MSMQ directrory wasn't >1GB
- Deleted and re-added the node in Solarwinds <--- This is the only step that I have found which resolves the issue thus far.
Any help would be greatly appreciated!