Hi been troubleshooting this for awhile and wondering if anyone else has ran into the issue and if they were able to resolve:
We currently have a ticket open for the issue but seems none of the troubleshooting steps have helped any. We have a primary server, HA server, and 11 polling engines. We are running 2019.2HF2, NPM 12.5, SAM 6.9.1, NCM 8.0, NPM 12.5 and UDT 3.4.0 HF1.
The rabbitMQ queues are constantly incrementing well over 40k. I have reset RabbitMQ, purged messages, validate Port 5671 is open between the primary and all the pollers, will also get the error when running configuration wizard on the pollers that (Services Configuration Failed, Rabbit MQ on Primary server is not reachable)
The last action plan provided to me was below:
Action Plan: Reset RabbitMQ for Orion Platform 2019.2
1- Stop All services (Main, APE and AWS).
2- Stop RabbitMQ service on services.msc (Main poller).
Run the queries in this article
https://support.solarwinds.com/SuccessCenter/s/article/Clear-Information-Service-Subscriptions
3- Navigate to C:\ProgramData\SolarWinds\Orion\RabbitMQ,
Cut the .erlang.cookie and place it in the backup folder on the desktop
4- Navigate to C:\ProgramData\Solarwinds\Orion\RabbitMQ\log.
Cut rabbit@ServerName.log to backup folder(Create a backup folder on the desktop and paste here)
5- Navigate to C:\ProgramData\Solarwinds\Orion\RabbitMQ\db\rabbit@server name-mnesia\msg_stores\vhosts\628WB79CIFDYO9LJI6DKMI09L\queues.
Cut the folders within the queues folder to the backup folder you have created.
6- Go to C:\ProgramData\Solarwinds\Orion\RabbitMQ\db\rabbit@server name-mnesia\msg_stores\vhosts\628WB79CIFDYO9LJI6DKMI09L.
Cut out the recovery.dets to backup folder.
7- Start RabbitMQ service on services.msc (Main poller).
8- Start All services (Main Poller).
9- Start All services (APE and AWS).
But this did absolutely nothing, kind of at a loss at this point, and want to this issue fixed, any additional ideas would be greatly appreciated.