Hello Everyone !!
Need your help with an issue i have been facing for the last few months. I also have a support ticket open (02048013) but we are not getting anywhere close to resolution.
Below is a brief summary of the problem -
Issue - High memory (99%) utilization on SolarWinds polling engine.
Description - I have 3 servers for SolarWinds in my environment. 1x main poller/server and 2x additional polling engines. The high memory problem is on one of the additional polling engines.
The hardware specs for the 3 servers are pretty much the same : Mainpoller - Win 2016 server with 32Gb of memory. The 2x additional polling engines are Win 2022 servers with the same 32Gb memory, all 3 servers have the same 4 processor CPU.
I only have IPAM and NCM licenses with SolarWinds and these are the only two subscriptions running. And i am running version 2025.2.1.
Every few days the memory on the problem polling engine hits 99% which is caused by "Solarwinds.Orion.LogMgmt.SyslogService" and stays that way until i go ahead and restart the service.
The number of devices sending syslog to the main poller is 715, non-problem polling engine is 51 and the problem polling engine has 126 devices. It is the syslog service causing the high memory issue, if i do not restart the syslog service, the server does not experience high memory.
Troubleshooting performed -
- Re-installed and upgraded the SolarWinds application on the problem polling engine
- Ran a packet capture for udp port 514 and noticed a couple firewalls as top talkers, however all the firewalls in my environment have the same syslog level (informational) configured
- Checked the interface utilization for these firewalls on our monitoring tool (LogicMonitor) to see if it really is sending a lot of syslog traffic, and no the throughput on these firewalls is less than 500mb and did not see any high spikes at any particular time
- Also checked the ethernet values on the task manager when high memory issue occurs and do not see the server getting a lot of input traffic (picture attached)
- If the suspected firewalls were indeed causing an issue, then the memory would go down at some point when the syslog traffic goes down ? But it just stays stuck at 99%
- Made changes to the database (tempdb transaction log file size and query optimizer hotfixes), but then all 3 servers use the same database, so i don't think its a database issue here. If it was the DB, it would affect all 3 servers ?
I did see a couple similar posts on here but none of them had a resolution.