I am trying to improve my availability metric accuracy for situations where everything appears to be working but the performance is effectively a system down. For example a DDOS attack may effectively take down a system as far as public access, but from inside the network everything appears fine. Traffic may be getting queued to the point that it appears that nothing is getting through. There is no apparent outage but it is effectively down. There are performance counters and other statistics we can poll that indicate a "Critical" condition but our availability is still 100%.
How can I use the critical performance to reflect the actual availability (or rather unavailability) of the system?