We've got a strange condition that's happened twice in the last two weeks. The system time on our main polling engine has randomly changed and then changed back a few minutes later. The latest iteration, it changed on 4/20 at 10:06:21 to 8/18/2024 at 1:21:34am. This caused a chain reaction of issues causing failures of SolarWinds components, reports and alerts to run and get out of sync, and database maintenance to run, purge large amounts of detail data that it thought it needed to remove, and subsequently causing DB Maintenance failures since the first time jump. The time changed back approximately 5 minutes after the initial switch.
For the DB maintenance failures I do have a case open with support which is, to be generous, spinning its wheels.
The offending PID was clearly running Windows Time. All our servers are set to sync to one of two DCs, and this issue has only happened on our main polling engine. Time didn't change on any of the DCs and no other servers had their times adjusted randomly; not our additional poller, nor any other servers. Our OS engineering team has investigated and has a case open with Microsoft but they can't find definitive root cause. They are suggesting that it may be STS and are recommending disabling that feature.
We upgraded to SolarWinds Platform 2024.1 on 3/20 and this hadn't happened prior to 4/12, so whether it is SolarWinds isn't clear, but as noted it hasn't happened on any other servers. Barring any other info, has anyone seen this happen with SolarWinds or just in general? Any thoughts anyone may have?
www.kaspersky.co.in/.../