I can't imagine we're unique in this problem but I'm trying to figure out the best way to solve this and I'm hoping someone may already have come up with a solution.
We have a lot of servers in Solarwinds SAM, around 1000+ and adding more. Many of these servers belong to different groups of people based on the applications they're running. The problem is letting the right people know when something happens. Sounds simple so far. We're using custom fields right now to help manage and categorize which works really great. We base a lot of alerts off of using the custom fields.
Here's where our needs get complicated and what I'm trying to solve without having to write a TON of similar alerts, This is kind of hard to describe to get the scope of this.
Dev needs an alert when something happens to their servers. Unless it's a SQL server, in which case that goes to a different team, but Dev still needs to know. Dev isn't production so we don't want the guys that handle production SQL servers to be notified unless the alert is a cluster fail alert, then they need to get involved. It gets more complicated when we get to the different Application servers that involve multiple groups.
So, right now everyone is complaining that they're getting too many alerts, including my group!
The idea I have is to take the alert and dump it to an external script, and let the script decide whom gets the alert. Is it a Database cluster down? Send and SMS page to the DBA group's phones unless it was a non-prod server. Low on drive space? If it's critical, send it to ops, otherwise send it to the app team that manages it. Do we have more than 10 VPN connections down at once, that's a problem, otherwise don't bother. Etc. Etc.
I'm trying to describe a message "router" of sorts that could take one alert that handles all the components of a template, and send messages to the appropriate targets depending on what is actually wrong. That way people only see actionable e-mails specific to them.
We have about 275+ alerts now. It would probably take triple that to try and carve them all up. There's gotta be an easier way.