This is a touchy subject for a lot of people, if hanging around some of the other thwackians is any indication.
I'm always between being strict about alerting practices
- If you want an alert, it needs a run-book, documentation, testing, and someone on the hook for it that's acknowledging it and fixing or closely tracking and monitoring the issue.
Or a more loose utilization
- You just want an email when X has a state change, you may or may not acknowledge it (this might be for testing or trying to catch something malfunctioning)
Or probably the worst use
- Just make this thing email these people, take these actions because it can.
Now, in living some of this, I find lots of admins, and engineers, bosses, and users conflate the golden "Actionable" alert and the warm and fuzzy "notification" that holds free the recipient from any responsibility.
This seems like a concept that, to me; could use some retooling in Orion.
Adding some natively supported divisions between an "alert" where its a full-fledged supported thing with expectations attached that someone is using it to fix a problem and a "notification" where one or many users will receive some information about an alertable-element with no-strings-attached.
Specifically I added "natively" because I believe this should be built in. Sure, you could create your own methodology to this or slice your existing alerts apart with some Custom SQL or SWQL and try to leverage the CC or BCC or maybe use Orion as just another source of data to relay to some type of information management system where your "real" logic is - but Orion is so close to that now - at least for network monitoring related intelligece. Some might consider event management core to this subject.
I'd love to see Orion gain the ability to offer Push Notifications that might run alongside Alerts, but would be a bit of a different animal, less tracking, less alert actions more single-serving data points. Keep the automation with "Alerts."
And (naturally) taking that one step further, what if the Orion platform would have the ability to have its own natively integrated status page for each of these paradigms? One with a focus on the platform, showing operational statuses of modules, reported incidents and the ability for anyone to start an email, text, or RSS subscription for those status changes. The other with a focus on all the elements your system is monitoring and being able to configure which appear and can be subscribed to in the same fashion.



Anywho - I'm curious as to what your collective hive-minds' out there thoughts and ideas about these topics.
- Is the existing implementation of "alerts" good enough or should there be more?
- How do you separate out "actionable" from "notify" for real-time information?
- For those that need to work on an issue and those that just want to be in the loop?
- How do you leverage aggregate reporting to help solve these challenges?
- Would you want "Push Notifications" from Orion on your desktop or mobile device if it were possible without another product?
Special thanks to the MVPs that helped refine these thoughts for me! jbiggley zackm silverbacksays