In a few instances, I have noticed CPU/memory polling has stopped for a node. I don't know why it happens, but it is a simple fix -- remove the resources from polling, add them back and it starts polling again. This has happened three or four times and I found it each time when I was looking at a specific node.
However, I don't look at every single node with regularity. So, there could be more nodes out there that have stopped polling. What I want is a report that would tell me if a node has stopped compiling these stats. Otherwise, I will only find it when I look for it, which means I will only find it when I need it.
I have found reports for reporting cpu/memory but to my knowledge, these reports will use the last measured value, which means if it has stopped compiling stats, it will just use the last one it measured.
Anyone have an idea of how I could accomplish this?