As bmrad pointed out in the Beta 1 Post, we've been working really hard to extend the integration with NPM and SAM introduced in Virtualization Manager 6.0. The team has been hyper-focused on simplifying configuration of the integration in order to bring you App-aware infrastructure monitoring, while preserving your flexibility to start with the tool you want (e.g. SAM or VMAN) and leverage the integration in the places which make the most sense. The features I'm going to outline here come directly from you and what you've told us matter most to detecting and remediating problems quickly in your virtualized environment.
I'm not going to go into elaborate detail here about the Sync Wizard, as bmrad did an excellent job of that in the Beta 1 Post. However, I did want to thank all of our Beta participants for giving us great feedback on the usability of the Wizard workflow. The Product Team completely understands that it doesn't matter how great the integration is, if you can't get the integration setup in the first place, none of the rest matters. With the feedback from Beta 1, we were able to streamline the messaging and workflow in the Wizard to get you up and running with the integration in minutes. We're not saying it's perfect (and definitely let us know where things don't make sense still!), but it should go a long way to making sure you never see broken integration resources ever again.
Baselines (Dynamic Thresholds) on Clusters, Hosts, VM's, and Datastores
As we discussed in the Beta 1 Post, the VMAN integration is now taking advantage of Dynamic Threshold, or Baselines, for thresholds and alerting purposes. An IT environment is a dynamic place, and when you add virtualized infrastructure to the mix, complexity leaps an order of magnitude. Isn't it about time that your alerting system recognized that fact? Well now it can!
So what can you set baseline threshold on? Our conversations with you determined the most important attributes across Datastores, VM's, Hosts, and Clusters for us to add to this release.. Obviously, we couldn't talk to everyone, and therefore are very interested in your feedback if there are other key metrics that would be valuable to baseline for your environment! In 6.1, you can set dynamic thresholds against the following virtual objects:
Given that, let's see what it might look like if I want to go set a CPU Load baseline threshold against Virtual Hosts in my environment. CPU Load is an important metric to measure on Hosts with Baselines. I may have Hosts that run heavily loaded all of the time and the VM's perform acceptably on those. Therefore, I may not care to use a static threshold like 80% Warning / 90% Critical, but instead just want my alerting system to tell me "when this host is under abnormally high load." So let's get started....
|What You'll Do||What You'll See|
|First step, of course, is to make sure that you've got the integration enabled. Simply go to Settings->Virtualization Settings->Enable Virtualization Manager Integration and enter the IP and credentials for your VMAN appliance. This will launch you directly into our new (VMAN 6.1) Synchronization Wizard. For more information on the Sync Wizard, reference the Beta 1 Blog Post.|
|Once the wizard is done syncing your environment, the integration is now setup and ready to go. You now will have access to all of the additional baseline goodness I mentioned above. So the first place to head to is back to the Settings page. On the main Settings page, you'll see a new sub-heading - Manage Virtual Devices. This was formerly the "Virtualization Polling Settings" menu option for VIM, but we've now extended it for setting Thresholds on your Virtual Devices and thus the name change.|
|Once you get to the Manage Virtual Devices page, select the Thresholds tab. This will reveal a dropdown menu where you can select virtual object types - VC's, Clusters, Hosts, VM's, Datastores - to view in the selection box below. You can also search here to further refine your selection. Given our example use case, I'm going to select Hosts here to view all the Virtual Hosts in my environment. This will show both VMware and Hyper-V hosts that are enabled by the integration (i.e. visible to VMAN and Orion).|
|Now that I've filtered the view to see all the Virtual Hosts in my environment, I can now multi-select the Hosts on which I want to set a Threshold. This might be useful if I want to set Static Threshold on some subset of my hosts and use automatic Dynamic Threshold for others. With multi-select I can do either quickly. Once I've selected all the Hosts I need, I click Edit Thresholds.|
Now we're into the heart of the matter. The "Edit Properties" screen presents me with several key pieces of information:
Our example use case involves setting CPU Load on these Hosts, so I'll select the CPU Load checkbox. This will reveal the current settings for that metric. In order to override the "Global Orion Threshold" with a custom value or baseline, select the checkbox next to "Override Global Orion Threshold or Set Dynamic Threshold."
As you can, see in the screenshot to the right, the current CPU Load thresholds are static thresholds:
We want to use Dyanmic Baselines for this metric. All I have to do in this case is select the Use Dynamic Baseline Threshold button and it will automatically set these Hosts to use Dynamic Thresholds for this metric.
Voila! We won't show an explicit value for these baseline thresholds, because they will be different for each node. If you want to see the baseline history (statistical data over time), you'll need to edit a single node at a time.
|So let's go check out one of the ESX Hosts we set the threshold on. If you look at the Host Details page, you can see that the Resource Utilization graph is showing Yellow and Red bars for the warning and critical thresholds we've now set. Oh BTW - that Resource Utilization sparkline chart? Also new in VMAN 6.1 for all vNodes!|
Advanced Alerting - Now with a Virtual Twist!
OK, so now that I've set my dynamic threshold, how do I actually get alerted? Well, with VMAN 6.1, you can now alert on the VMAN data presented in the integration. So I can use Orion's Advanced Alert Manager to set alerts just as I would any other Orion object. We've even included a subset of the standard VMAN alerts out-of-the-box. These alerts include:
|Cluster||Host||VM||Datastore/Cluster Shared Volume|
So continuing our example for above, let's take a dive in the Orion Advanced Alert Manager and setup an alert on our threshold:
Web-Based Reporting - Leverage the Newest Orion Core Feature on Your Virtual Data
So now you've seen how to set a dynamic threshold and alert on that, all in Orion, all on data collected by VMAN. Pretty cool, huh? Well the last piece is Orion Reporting. Orion has recently introduced Web-based Reporting, and I'm happy to say that the Web-based Reporting system can also now be used to report on the data presented in the VMAN integration. As a quick example, let's go ahead and quickly show you a report I created to show me all of the VM's in my environment with High CPU Load.
Account Limitations - Role-based Access Control, Orion Style
I'll wrap up by briefly discussing Account Limitations, also known as Role-based Access Control. This has been a longstanding requests from VMAN customers and we're very happy we are able to deliver it through the integration in this release by leveraging Orion's Account Limitations feature. This functionality has become more critical as folks use VMware and Hyper-V estates as Private Cloud infrastructure. User access can now be restricted under different level of the virtual "hierarchy." For example, if I have a large environment with multiple vCenters, I can limit users to see only the virtual objects - hosts, vm's, clusters, datacenters, - under a single vCenter. These limitations can now be applied in the following ways:
- View everything under a Single or Group of:
- View a single or group of VM's
- Datastores don't fit neatly in the above hierarchy, so they have their own control.
Note that the embedded Flex (Flash) views from VMAN shown in the Orion interface (e.g. the Map view) is all or nothing since the limitations are applied on the Orion side, not on the VMAN side. There's not much to show here, as Account Limitations just limit your view of the infrastructure, so it doesn't really make for interesting screenshots. Nonetheless, this should be a useful feature for our customers running these products in a private cloud deployment.