I have 200+ alerts copied from a master alert that only differ in node name. Most of these alerts trigger as they should and function properly. From time to time an alert is found that will not trigger alert actions but will send reset actions when the node has recovered. I have tried manually dropping the nodes to reproduce the event and it happens consistently.
I opened a ticket and we discovered if you copy the alert (changing nothing) the alert will work correctly as the other alerts do. This worked fine for three alerts and now I have discovered one that refuses to send trigger actions. I have tried modifying my trigger actions in the alert and changing alert delays, etc. but still it refuses to execute the trigger actions.
Does anyone have any insight into why this may be happening. My SQL is pretty weak but I'm willing to try what I have to and get all of my alerts working correctly.
Running 9.5 SP3
428 Nodes