strange "up" messages and workaround ?

Question

hey guys,

i have a strange problem here : i'm monitoring more than 50 CISCO 3548XL's here at the office. from time to time (preferably at night) we get a pager alarm stating that one of the switches (which one changes randomly) is "up" again. but it never went down. 
ok, seems like the switch went to "unknown" state and "up" before it Orion considered it "down". since my alerts says "notify on status change", the pager alert seems correct. here comes the "but"... the switch never went to "unknown" state. 
what i want to do now : Orion should report the switch state "up" only if the previous state was "down". alert suppression, got that in mind. 
do i have to setup one alert for each device ? 
i can't say "alert me when status of switch %1 is up if status of switch %1 has been down"...

hope someone understands my crude ideas here :-/

mihi est propositum in taberna mori

Seraphym · Answer

maybe I found the solution. 
the database maintenance jobs runs every night at 02:00 pm. the messages were always sent between 02:00 and 04:00. maybe our machine is simply not powerful enough. I thought 3,0 GHz and 4 GB ram should be enough...

mihi est propositum in taberna mori

Seraphym · Answer

thanks for your feedback on this topic.

in fact some switches change to warning and then to up status really quick. so the node never goes down. it looks like just one polling can't reach the node. node goes to warning, next polling reaches target, node goes up. 
i now changed the alert to notify only when a node goes "really" down. the up-notification is disabled. 
@bleearg13 : had spanning tree in mind too. but all the servers connected to the switches are up and running without one second of downtime or non-availability.

mihi est propositum in taberna mori

bleearg13 · Answer

Is it at all possible that you have an underlying network issue occurring?  I've seen really bizarre things happen when spanning tree is not configured properly in a network that needs it.  Unplugging a server on one switch would cause a completely different switch or node to "go down".