Agents are reporting as down on 3 of my APEs, they're working fine on other APE's.
Business layer log shows these errors:
2020-06-11 09:55:17,443 [Scheduler] ERROR SolarWinds.BusinessLayerHost.PluginManager - Child process for
\Program Files (x86)\SolarWinds\Orion\AgentManagement\SolarWinds.AgentManagement.ServiceCore.dll not found and will be restarted.
2020-06-11 09:55:17,443 [Scheduler] INFO SolarWinds.BusinessLayerHost.PluginInstanceSeparateProcess - Initialize plugin
\Program Files (x86)\SolarWinds\Orion\AgentManagement\SolarWinds.AgentManagement.ServiceCore.dll.config
Most Agent Manage logs are blank with the exception or Agent Management service and Agent Management Watchdog. The service log doesnt give any erros, the watchdog log shows the error:
] WARN SolarWinds.AgentManagement.Messaging.Contract.MessagingServiceProxy - Connection to messaging service at 'net.pipe://localhost/SolarWinds/AgentManagement/Messaging' faulted. Running reconnection.
2020-06-10 09:10:08,367 [11] ERROR SolarWinds.AgentManagement.Messaging.Contract.MessagingServiceProxy - Messaging Service endpoint 'net.pipe://localhost/SolarWinds/AgentManagement/Messaging' was not found. There was no endpoint listening at net.pipe://localhost/SolarWinds/AgentManagement/Messaging that could accept the message. This is often caused by an incorrect address or SOAP action. See InnerException, if present, for more details.
2020-06-10 09:10:08,368 [9] ERROR SolarWinds.AgentManagement.Messaging.Contract.MessagingServiceProxy - Error closing Messaging Service channel factory.
System.ServiceModel.CommunicationObjectFaultedException: The communication object, System.ServiceModel.Channels.ServiceChannel, cannot be used for communication because it is in the Faulted state.
Trying to install an agent manually on an end client an point to any of these 3 APE's gives the error:
Https connection to ((APE server name)) on port 1778 succeeded but the agent management service did not respond
Troubleshoot steps ive tried:
- Confirmed all plugins are there, confirmed the SolarWinds.AgentManagement.ServiceCore.dll file is there.
- Confirmed I have all c++ redistributable packages
- I repaired the core installer 2x's, repaired job engine, and collector services and re-ran config wiz (3xs now)
- Followed every KA/thwack post I could find on the matter:
https://support.solarwinds.com/SuccessCenter/s/article/Forcing-the-Business-Layer-to-load-plugins-in-a-separate-or-64-bit-process-in-NPM-12
https://support.solarwinds.com/SuccessCenter/s/article/The-NPM-Summary-homepage-intermittently-loses-connection
https://support.solarwinds.com/SuccessCenter/s/article/DPI-and-AgentManagement-BusinessLayer-issues
https://thwack.solarwinds.com/t5/NPM-Discussions/Polling-Engine-Down-Database-Sync-not-occurring/m-p/352147
https://thwack.solarwinds.com/t5/NPM-Discussions/SolarWinds-BusinessLayerHost-exe-terminated-due-to-an-unhandled/m-p/180573
- Confirmed AV exclusions are in place
I have had a ticket open with SolarWinds and I feel like I'm running around in circles with support. Does anyone have any ideas?