Good afternoon fellow thwackers.
I have some very weird things happening on some servers. I don't understand why it happens, but sometimes nodes apparently stop responding to SNMP. There's no real reason why it does this. The service doesn't stop. The service doesn't start responding when I restart the service, nor does it start up when I restart the server.
I am getting networks to check to see if anything stops the service from sending responses, likewise, I can ping from server to server. There is no pattern to when it starts and stops and neither is there any real pattern to the servers. It just randomly stops and then later on during the starts again.
It's not in a particular VLAN/cluster/domain. It's all spread out evenly among our infrastructure.
Networks have come to me stating that no SNMP was blocked.
The only thing I could try was restarting the solarwinds box.
Any other ideas?