We have two clusters:
NS1 (hardware is InfoBlox 550 HA)
which is comprised of nodes ma1 and ma2
and NS2 (also InfoBlox 550 HA)
which is comprised of nodes cf1 and cf2
The nodes on NS1 constantly report 100% CPU usage, but when the admin logs into the machine he can see that they are not truly using that much CPU. The nodes on NS2 report correctly.
I have built a custom poller using a formula that we obtained from InfoBlox support:
cpu_usage = 100 * (ssCpuRawUser.0 + ssCpuRawNice.0 + ssCpuRawSystem.0) / (ssCpuRawUser.0 + ssCpuRawNice.0 + ssCpuRawSystem.0 + ssCpuRawIdle.0 + ssCpuRawWait.0)
that I have assigned and added as a graph and a gauge to the node details page for ma1, ma2, cf1 and cf2. Our admin compared the poller results on ma1 and ma2 to actual usage and says that they are reporting accurately.
My first question is of course why is Orion seeing the CPU at 100% on the first set of nodes and how can we fix this?
And secondly, the poller graph and gauge on the second set (cf1 and cf2) show no data - the gauge piece shows this message: No pollers assigned or no statistic results. Click edit button to change universal device poller assignment!
When I click edit, the correct poller is assigned.
Thanks in advance for any help!
klc2009
.