Am I running into a string of bad luck or is the Orion platform becoming more and more unstable?
- For the last year or so I randomly get agents that the polling engine can't connect to. I have to uninstall the agent and delete the node in Orion and set it up new.
- This last upgrade to 2020.2.6 HF1 on the MPE went fine. When I try to do the installation on the APE it says all products are up-to-date. If I look on the Web Interface in the Deployment Health tab it says modules are out of date on the polling engines. When I go to Updates & Evaluations it says all three of my polling engines are up-to-date. (Support wants to uninstall and re-install the APEs. I'm not sure how I feel about that solution.)
- We get alerts that a node is down and then one minute later a recovery alert. The node was never down.
- The dashboards don't update with the current status of nodes.
- The node status says up but the CPU and Memory data hasn't been collected for weeks.
- The List Resources feature never returns results when done from the MPE, and is the typically slow from the APEs.
Man I'm getting frustrated with these problems piling up.