Hi All,
I'm starting to roll out a large NPM and APM infrastructure to monitor many hundreds of servers.
One issue i'm worried about is the size of 'configure advanced alerts' popup will get to.
For example, i'm setting up default alerts for critical disk space (95% +), server down, server rebooted etc which will be mailed to an operations team to resolve the issues. With this we'll have at least 6 separate alerts setup.
Other people within the business will also want to be mailed about servers they are interested in. For example the Oracle team will want alerts from Oracle servers and also these servers might have different monitoring requirements than the default alerts.
So if we start overriding our default alerts and have to create lots of new alerts to send emails to different people in the business we'll get hundreds of alerts in the list which will become extremely unmanageable.
Is there a suggested way to make this more manageable? Is there a way to email different groups of people (if for example a server is an oracle server) without having to create a whole new set of alerts?
Also if one group of servers needs a different threshold applied, we'll need to create a new group of alerts. Is there a simple way to override these without creating a whole new set of alerts?
Thanks,
Chris.