Home Page -
Series 1 -
Episode 1
OK, let's get started
Series 1 is all about KPIs and ... there is going to be plenty of them.
All these KPIs are state of the art charts with custom-made SQL driven queries. They are #1 go-to stats for any Technology Manager to see things through. They track the whole escalation flow, break-fix timing, SLAs, alerts rates, compliance, aging alerts, noise alerts and many other metrics that help you to understand how "busy" your support teams might be and how stable your infrastructure really is, over time.
Here is sneak-peak preview of how it might look like ... (resolution is deliberately low - my intention through this series is for you to build that dashboard for your own infrastructure):
All these KPIs are designed to be used as a widgets/resources on summary views. I personally have a dedicated view summary page and about 20 or 30 different widgets what is making it all up. I have further split this page into tabs to group them together by focus area and ensure page loads faster, with about 8-10 widgets per tab.
So, to get the most out of this series I recommend you to create a new summary view and start assembling your KPIs right there. I will share all my KPIs widgets with you - one per post. So, be sure to subscribe to updates on my Home Page to not miss them
Without further a do ... here is the first one
Take a look at the widget screenshot below. It sits on top of the first tab of my KPI view page and gives me an idea what's happening at a glance with volume of alerts compared to volume of events on one graph, pretty much live, right now, as we speak
- Left Y-AXIS - number of alert in a given minute (blue color)
- Right Y-AXIS - number of events in a given minute (purple color)
- Bottom X-AXIS - timeline
Notice that this window is 60 minutes. I deliberately have chosen such a small window to be able to see everything minute-by-minute, without summarizing any data into averages. When you hover over any bar - you can actually see exact numbers for that particular point in time (minute). Stay tuned and we will have more KPI charts coming that will have larger window for more broader overview.
How this is done
(1)
Add Custom Chart widget onto your summary view
(2)
Give it Title and Subtitle. I tend to maintain the following format for my subtitles to ensure consistency by specifying period and sample interval so that it is immediately obvious how wide is actual window (period) and how aggressively we summarize data (sample)
- Title: Alerts and Events Trigger Rate (minute-by-minute)
- Subtitle: Period: Last 60 Minutes | Sample: Once per minute
(3)
> Click on SELECT DATASOURCE
> select Advance method as Database Query
> select SQL radio button
> paste the following SQL script (source code attached)
> click on preview results to make sure data comes back as expected (should be aggregated into minute-by minute intervals)
> give a name for your selection (in my case I call it "Rates")
> click UPDATE DATA SOURCE
So far we have the following ...
(4)
Now, this is what took me quite a while to figure out and adjust SQL accordingly:
> Group data by: Legend
> Legend shows: Legend
> Set time period to 1 minute
(5)
Now, let's setup left Y-AXIS:
> Add data series to left Y-Axis
> Select Alerts > Add data series
> Click on more link to see additional settings
> Click on time column and select SAMPLE column
> Click on color drop-down and select color of your choice for alerts columns
(6)
Let's now repeat this for the right Y-AXIS column, but in this case you need to select Events as a data series and pick different color. All the rest steps should be the same as per section (5) above
(7)
And finally, very important last step:
> go to time period drop down and select Custom
> give it a name "Last 60 Minutes"
> select last 60 minutes
> ADD
(8)
So, we now have the following setup (as per full screenshot below). Now, when you click SUBMIT you should hopefully get your first KPI chart working
NOTE:
You may have an issue with timing, whereby you graph's last 60 minutes comes as blank (something like below). If that's the case - please post a comment below and I will explain how to deal with this issue. This is because of the very strange behavior with the way SolarWinds processes time which I still don't understand 100%, but I have a workaround for such issue
Enjoy
Remember to give it a kudos and subscribe for updates on my home page
With Gratitude,
Alex Soul