quote:Originally posted by kannedI recently upgraded from 7.1.15 to 7.2. After this upgrade I continued to get a large number of false positives from Solarwinds. I changed the ICMP timeout to 5000 and byte size to 0, hoping this would solve my problem. It did not. I then changed the Polling rate to 5 minutes. This still resulted in a large number of false node down events.Is anyone else experiencing these issues?I have since rolled back to 7.1 and everything seems fine as before. I guess I'll wait a little longer next time before upgrading.Joshua E. MorehouseSenior Network EngineerFriedman, Billings, Ramsey Group, Inc."Capital For Your Conquest"jmorehouse@fbr.com
quote:Originally posted by keistWhat all of you should do is verify the packet loss by using a sniffer and using the debug log of orion, tech support can help you turn on the debug, Also i had the problem about 8 months ago and it turned out to be our cisco 6509 code had a bug in it, since orion sends only one ping packet initially if that gets lost then you see the 10% packet loss, even to this day i have some devices report packet loss of 10% then it goes away. What we discovered was that we use a program called altaris for pushing patch updates on workstations and the manager of that system was trying to use wake on lan. this wake on lan feature had a bug that would flood the network with broadcasts from multiple machines on the network simultaniously since a broadcast is serviced by the route processor and not fast switched our 6509 have a max rating of like 20,000 packets a sec on the supervisor engine, when these machines would broadcast at the same time it would create 50,000+ packets/sec thus causing input queue drops on vlans, thus causing a missed ping. this is not limited to altaris other mis-behaving applications could cause the same issue. everytime we have looked into the 10% packet loss thing we verified that in fact that orion was sending the first ping to the device however the response never got back to orion server. Hope this will help i have spent many hours troubleshooting my issues.Dan
quote:Originally posted by josh@solarwinds.netThere is a Beta Release of the Orion NPM SLX Version 7.3 at:ftp.solarwinds.net/.../SolarWinds-NetPerfMon-V7-SLX.exeVersion 7.3 includes changes that should minimize the false positives that you are seeing.Please send any feedback on this beta release to support@solarwinds.net
quote:Originally posted by Network_Gurupancamo,The LSASS is usually caused by quitting your RDP session to the server withoutfirst closing NetPerfMon Manger (SysManager). Restarting the polling service willbring LSASS back under control. Apparently it is a Microsoft bug.Is this the same issue you are having, or is this something different?-=Cheers=- NG