• Management Pack:  SCOM 2016
  • MP Version:  1.0
  • Released:  10/19/2018
  • Publisher:  Microsoft

Agent processor utilization Monitor

  • ID:  Microsoft.SystemCenter.HealthService.SCOMpercentageCPUTimeMonitor
  • Description:  Monitor all agent processes to identify potential issues with the agent using too much processor time.
  • Target:  Health Service
  • Enabled:  On Essential Monitoring

Operational States

Name State Description
Processor time over threshold Error  
Processor time under threshold Success  

Overridable Parameters

Parameter Name Default Value Description Override
Frequency (seconds) 321  
Timeout (seconds) 300  
Synchronization time 00:00  
Threshold 25  
Number of consecutive samples for critical state 6  
Number of consecutive samples for healthy state 3  

Alert Details

Monitor State Message Priority Severity Auto Resolution
Processor time over threshold (Error) The Operations Manager agent processes are using too much processor time Medium Critical Yes

Run As Profiles

Name
Default

Monitor Knowledgebase

Summary

This monitor calculates the total CPU utilization of the Operations Manager agent and its related processes, and generates an alert when CPU utilization exceeds a specified threshold for a specified number of consecutive samples.

The monitor’s underlying script works by locating and sampling the CPU utilization for the Operations Manager agent process (HealthService.exe), its child monitoring host process (MonitoringHost.exe) and the child processes of those monitoring host processes (cscript.exe, PowerShell.exe, etc.). The script runs the calculation three times and outputs the average of the three consecutive samples, which is then used by this monitor to determine critical or healthy state.

Causes

Excessive CPU utilization of the various Operations Manager agent processes may indicate that agent or one of its underlying dependencies is not operating properly. If the agent and its underlying dependencies are updated properly, then the agent is being over-utilized on the system being monitored. This may be short-lived, due to a recent update in the management group, such as the deployment of a new management pack, or this may be due to the agent truly being under excessive load, in which case tuning may be required.

Resolutions

To ensure that the agent and its underlying dependencies are operating properly, check the following:

  • Verify that the most recent version of the Operations Manager agent is installed on the system.

  • Verify that the update for MSXML 6.0 provided in Knowledge Base article 968967 (http://go.microsoft.com/fwlink/?LinkId=181885) is installed.

  • If the system's operating system is Windows XP, Windows 2000 Server or Windows Server 2003, ensure that the system is running Windows Script Host 5.7 or later. The following link provides the download locations for Windows Script Host 5.7 http://go.microsoft.com/fwlink/?LinkId=181884.

If the condition persists after those configurations are verified, then deeper investigation is required to understand what is driving CPU utilization. Investigate further using any combination of the following steps:

  • Review the recent history of agent processor utilization, workflow count, and module counts using the following view: Agent Performance View. The agent processor utilization data will give insight into whether the issue is recent or has been occurring for a longer period of time. The workflow and module count data will give an indication of the workload that the various rules, monitors, and discoveries are putting on the agent. This data should also be compared against healthy agents to use as a contrast.

  • Use a tool such as the Effective Configuration Viewer (http://go.microsoft.com/fwlink/?LinkId=182300) to understand the number of class instances discovered on the agent. More class instances can lead to higher workflow and module counts, which can result in more workload.

  • Using Performance Monitor, collect more detailed % Processor Time measurements from the Process object. This will give insight as to which processes are contributing the most significantly to overall processor utilization.

  • Review any recent management pack updates or changes to see if they correspond with the increase in CPU utilization.

When the cause or causes are identified, any one of the following steps may be taken to address the issue:

  • If a management pack change was made recently or a new management pack was deployed, monitor the situation to see if the problem continues.

  • Reduce the frequency of discoveries via overrides to spread out their CPU utilization across the day. Doing this comes at the trade-off of discovery potentially taking longer to occur.

  • Reduce the frequency of rules or monitors that are run on a schedule to spread their CPU utilization across the day. Doing this comes at the trade-off of monitoring.

  • If the agent is managed by multiple management groups (a configuration referred to as “multi-homed”), that will contribute to higher processor utilization as well. Consider reducing the number of management groups that the agent is managed by.

If all of the steps above do not produce a solution, contact Microsoft Customer Service and Support (http://support.microsoft.com/).

External References
This monitor does not contain any external references.

See Also for SCOM 2016 Management Pack


Downloads for SCOM 2016 Management Pack

AZURE OPTIMIZATION ASSESSMENT GET STARTED
MIGRATION TO AZURE GET STARTED
SYSTEM CENTER MIGRATION TO AZURE GET STARTED
MIGRATION TO AZURE FOR SQL AND WINDOWS 2008 GET STARTED