Hello...
We have started monitoring and managing a number of Cisco Nexus 7010 devices in our network using Solarwinds Orion NPM. On the whole these devices are able to be managed like any other Cisco switch or router device, however there is one particular issue that we are coming across which is rather ugly.
When we reload or suspend/unususpend one or more of the virtual routers (VDCs) in a Cisco Nexus 7010 device, sometimes Orion NPM stops monitoring the device correctly. Typically this involves one or more interfaces going into an unknown state, or the status of the interfaces are not updated when all the interfaces come up again after a reload. From the Node Details page we have tried doing a 'Poll Now', 'Rediscover' or an unmanage and then remanage of the node. Some of these actions sometimes appear to work but at other times none of these actions work.
When a node has this issue if you do a 'List Resources' on the node it displays the correct status of all the interfaces. If you 'submit' or 'cancel' out of the List Resources page there is no change in the status of the interfaces that is shown in the Node Details page. This seems to indicate that Solarwinds Orion NPM is able to perform an SNMP poll of the node and receive correct information.
The only way that we have found to date to resolve this issue is to stop and start the NPM poller that is polling this device. We think this is a very extreme action to take.
If anyone else out there has had this issue and has found a way around it, I would love to hear from you. If this is a known bug in NPM I would also like to know if there is a way to work around it. We really do not want to have to resort to restarting a polling engine because the moniotoring for one node no longer appears to be work.
Regards,
... Simon Evans
(Currently running SolarWinds Orion Core 2011.1.0 SP1, APM 3.5, IPSLAMGR 3.5.1, NCM 6.1, NPM 10.1.2 SP1, NTA 3.7, IVIM 1.1.1)