This legacy SAM application monitor template contains performance & statistics counters for monitoring batch account.
Note: This template was deprecated in SAM 2020.2 and is no longer included in fresh installations of SAM. Use the Microsoft Azure Batch Account API poller template instead. If you're new to API pollers, watch API Pollers: When SNMP Won't Cut It.
Prerequisites:
- PowerShell module (Azure, AzureRM) must be installed prior to running this template
- To connect with Azure account the following parameters are required: SubscriptionID, ApplicationID, TenantID, Secret Key. Note: Any Azure App (with its name & ID) having minimum role as 'Read Only'.
- Batch Account name for which metrics must be calculated.
- PowerShell version 5.0 or later.
Credentials:
Login credential to access Azure Portal. This has to be passed as script arguments per prerequisites 2 above. e.g. <subscriptionID>, <TenantID>, <ApplicationID>, value=<Secret Key>, < Batch Account Name>
Optional Parameters which can be passed to script argument:
- value=<Time Interval> - for which data must be fetched (in mins). Default is 10 mins.
- No. of retries as the second last argument (Default is 3)
- WaitTime as the last argument (Default is 0.5 seconds)
Notes:
- To install Azure module, execute below PowerShell commands on the machine where template would be running against.
Install-Module -Name Azure.
Install-Module -Name AzureRM.
- Windows Administrator on the machine where template would be running against. Script should run with administration privilege
- The ApplicationID with which you are making a connection to Azure portal (as mentioned in Credential/Prerequisites) must be registered in Azure Active Directory as contributor role for the monitored application.
- This template was tested with AzureRM Module 6.9.0
Portions of this document were originally created by and are excerpted from the following sources:
https://docs.microsoft.com/en-us/azure/monitoring-and-diagnostics/monitoring-supported-metrics
MONITORED COMPONENTS
Dedicated number of cores in the batch account
This component monitors the total number of dedicated cores in the batch account.
Unit: Count
Number of nodes created
This component monitors the total number of dedicated nodes in the batch account.
Unit: Count
Number of idle nodes
This component monitors the number of idle nodes.
Unit: Count
Number of nodes leaving the Pool
This component monitors the number of nodes leaving the Pool.
Unit: Count
Number of offline nodes
This component monitors the number of offline nodes.
Unit: Count
Total number of pools that have been created
This component monitors the total number of pools that have been created.
Unit: Count
Total number of pool deletes that have completed
This component monitors the total number of pool deletes that have completed.
Unit: Count
Total number of pool deletes that have started
This component monitors the total number of pool deletes that have started.
Unit: Count
Total number of pool resizes that have completed
This component monitors the total number of pool resizes that have completed.
Unit: Count
Total number of pool resizes that have started
This component monitors the total number of pool resizes that have started.
Unit: Count
Count of rebooting nodes
This component monitors the number of rebooting nodes.
Unit: Count
Count of reimaging nodes
This component monitors the total number of reimaging nodes.
Unit: Count.
Count of running nodes
This component monitors the number of running nodes.
Unit: Count
Count of nodes where the Start Task has failed
This component monitors the number of nodes where the Start Task has failed.
Unit: Count
Count of nodes starting
This component monitors total number of nodes that are starting.
Unit: Count
Total count of tasks that have completed
This component monitors the total number of tasks that have completed.
Unit: Count
Total count of tasks that have completed in a failed state
This component monitors the total number of tasks that have completed in a failed state.
Unit: Count
Total count of tasks that have started
This component monitors the total number of tasks that have started.
Unit: Count
Average count of nodes in the batch account
This component monitors the average number of nodes.
Average count takes following into consideration.
- Offline Node Count
- Idle Node Count
- Running Node Count
- Unusable Node Count
Unit: Count
Count of unusable nodes
This component monitors the number of unusable nodes.
Unit: Count
Count of nodes waiting for the Start Task to complete
This component monitors the number of nodes waiting for the Start Task to complete.
Unit: Count
Average number of cores in the batch account
This component monitors the average number of dedicated cores in the batch account.
Unit: Count
Troubleshooting Steps:-
Issue: - Application monitor or test is failing with unknown error.
Solution: - If test or application is failing with error “unknown status”, try following steps
Go To you do have to EDIT SCRIPT.
Check script output manually by clicking “GET SCRIPT OUTPUT”.
Provide node.
Provide credentials.
- Save.
Now rerun test.