Is there a way in NCM to make a change on multiple nodes, do post/pre checks (interface status, node up, etc) and create a report of failed nodes?
One way to do it, would be to combine the timing of the execution of the NCM change, with a timed alert.
For instance, if you decided to make these NCM changes between 01:00 and 02:00 in the morning, you could setup a new alert, which looks for nodes going down or interfaces going down, during these times ONLY, and then set the triggers on this alert accordingly to ensure the relevant actions take place.
That is an idea to consider... Was looking for a possible solution that was part of a script and/or config change template. I haven't found an option/way to stop a job on failure either.. I'm needing to stop the job if a node fails, so I'm not sending a field tech to multiple outages in one day caused by the job/config change.