This discussion has been locked. The information referenced herein may be inaccurate due to age, software updates, or external references.
You can no longer post new replies to this discussion. If you have a similar question you can start a new discussion in this forum.

2 nodes: same IP, different NodeIDs

I'm not convinced that the answers I got on case # 281834 are accurate so I'm going to ask the community for their thoughts.

I recently discovered a node that was up (green indicator on my map) but had no history on any of the graphs I looked at ... not even the ICMP graphs.  I thought it was odd so I put in a ticket to ask about it and in the mean time I recreated the node.  The new node started collecting historical information right away, so I knew the old node was broken, I just didn't know why.

It turns out that before I came to this company, we upgraded from Orion v9.x to v10.x and a handful of nodes didn't get the new PollingEngineID.  A great support tech gave me a query to change any PollingEngineID that wasn't 1 to 1 and suddenly the broken nodes started working.

But now I was faced with an unusual situation.  I had two nodes, each with its own NodeID but both were pointing at the same IP Address.  I monitored for a few days and one node would go down briefly but the other would not, so I opened a new case. 

While I waited for a reply I decided to double poll another handful of nodes.  I had nodes at 10.0.50.2 - 10.0.50.4 with nothing at 50.5, so I created a node for 10.0.50.5 ICMP only and once it was created, changed its IP to 10.0.50.2.  I repeated the process till I had 50.2 - 50.4 double monitored.  All eight nodes were collecting data but only the original nodes would trigger alerts.

The first tech I spoke with told me that double monitoring this way was a bad idea.  He told me “Since you have the both nodes assigned to monitor the same IP address it is definitely creating a conflict and also it may be causing a mac flapping."  MAC flapping?  Really?  I've seen port flapping but never MAC flapping?  I asked to escalate and the next tech was at least a little better with her techno babble.  She insisted that double polling wouldn't work from a single polling engine ... so I set up an eval copy of Orion to monitor my handful of nodes and deleted the duplicates I'd created.

Now when a node goes down, both copies of Orion alert me, but she still tells me that it's a bad idea to double poll.  She said:

You don’t want duplicates from same machine for a couple of reasons.  You are sending data about the same IP address and going to the same database this can cause confusion for customer and database.  The device can also block SNMP due to too many hits.  If the node was to go down in a fast poll this would mean you would also have two fast polls on this device causing bottleneck of traffic and delay’s for other information to be wrote.

I almost buy this but if I run a set of pings to a device from two different DOS windows they don't conflict with each other, so I think she's just blowing a bunch of smoke up my ... tail pipe.

My thought is that Orion sees each node as an individual NodeID.  It doesn't care what that NodeID's IP Address is as long as it has a place to send pings and SNMPGet requests.  So there's no reason that Orion can't monitor the same IP using two different NodeIDs.  Testing seems to indicate otherwise, however, since my duplicate nodes wouldn't alert when they were on the same polling engine but they would when they were monitored by separate copies of Orion.

Can someone give me a believable reason why we can't double poll with a single copy of Orion?  I don't want to have a single NodeID try to ping more frequently than every 60 seconds, but if I had 2 nodes for the same IP, each polling at 60 seconds and if my polling was optimized then in theory I should be able to collect data from one or the other node every 30 seconds.  And triple or quadruple polling would give me even more reliable data on when a node goes down/up.

What are -your- thoughts?

  • Garrison,

    What is your goal here? Do you want these nodes to show up twice or are you just trying to understand what happened? You should be able to change the IP address of a node without any issues related to polling or alerting.

    Orion doesn't support monitoring double monitoring the node with the same IP and different node ID. It's not something that can be accomplished "naturally" in the UI. You could monitor the same node with different IP addresses, but because of the upgrade and script used to get things working, you're in slightly broken state with the same node IP but different Node IDs. What's wrong with cleaning up these duplicates and just polling the node once? If you want higher polling granularity then you should just adjust the polling frequency (although I can understand if you are trying to get higher granularity on a small set of specific nodes as opposed to changing this for all nodes).

    You are right, if you installed an eval and monitored the same node from two different servers both servers should report basically the same statistics and there shouldn't be any gaps. The only time this is an issue is when a device gets overloaded with SNMP traffic and drops it. I doubt you're running into this issue unless you've cranked up your polling intervals or have several other management system hitting the device.

    Mav

  • Hey Mav,

    My goal is to determine why my peers (at work) think that my nodes for New York, Milwaukee, Canada, and the Internet (8.8.4.4) are all going down at the same time.  I think I finally narrowed the issue down to an edge device that is rebooting so that some or all of these devices go down randomly but come back up within 2-4 minutes.

    My concern was that sometimes it was only one of these nodes that would go down, sometimes it was any two of the four that would go down, and sometimes it was all four.  I've got other external nodes but they didn't seem to be affected.

    My goal here was to see if Orion was having problems polling nodes that are not in the same city as the Orion server.  I know that Orion can do this ... I've seen an Orion server in New York poll London and New Deli at the same time ... but it didn't seem to be working for me. 

    The tech support rep I was working with told me that Orion can't poll two different nodes when they have the same IP address.  That doesn't make sense to me because the column in the Nodes table that is the unique index is NodeID, not IP Address.  So if NodeID 100 is 2.2.2.2 and NodeID 101 is also 2.2.2.2, Orion shouldn't have a problem with that ... as long at the unique identifier (the NodeID) is not the same for both nodes.  But if the IP Address is the unique index then I shouldn't be able to get a SQL query to update the table in such a way that it accepts the same IP for two different NodeIDs.  Either way, it seems that something is not working as designed.

    Of course, the most logical explanation is that there's something else going on that I don't understand.  I accept that and am trying to find the flaw in my logic so that I don't accidentally cause myself more problems than I fix by working with my nodes.  And that is the reason for this thread.

    I hope that answers your question.

    Garrison