I am working on a form of automatic fail over in the event a polling engine fails.
Seems like an alert could be set up with an action:
Alert:
node selection: (all nodes assigned to engine 5)
Alert Trigger: (engine 5 services, interface, (whatever you want) not working
alert action: execute (script action, sql update, best solution) to move effected nodes to engine 11, only have to change one value in the DB.
Here is what I have now: alert suppression using dependencies:
Custom property_poller: value = poller1, poller2, poller3, etc....
Group with dynamic query: if CP_poller = poller 1, assign to group 1, (etc..)
Depenency: all members of group 1 are dependant on poller1 interface being operational.
This solves the problem of sending off 1000 alerts, when someone accidentally disables the interface, but I think the ability to assign members to a group based on the value of their engineID would be more direct and more efficient.
Just need a way to use engineID in a dynamic query, and use engineid as the node selection criteria in an alert that can execute the action to change the engine id on the nodes.
For example:
Group1 = all nodes assigned to poller 1.
group1 is dependant on poller1 being operational
alert:
all nodes on poller1.
if poller1 is down, execute action to move nodes to poller07.
this way once the nodes are moved they are also moved to group 7 and alerting will automatically resume, but will not alert while poller1 is down and they are assigned to poller1.
So the question is, is it possible to assign nodes to a group based on actual assigned polling engine?
Can an alert object selection criteria be selected based on assigned polling engine?
Can a Sql update or something similar be created as an action?