Metrics install automatically if you have Redgate Monitor installed.
If you are using Redgate’s SQL Server monitoring tool, Redgate Monitor, you can instantly install and run this metric on your servers.
This metric returns a count of the job steps that failed in the latest execution of all jobs on the instance. This should be a calculated measure, where the change is what should alert the DBA. If any step in a job fails in the latest execution, this alert counts those failures and returns a value. This means if a job step fails in a job, and the job still reports success, this value changes.
If the job re-runs, and no steps error out, then this metric would report a 0.
Metric definition
Name
Finding Job Step Failures
Description
This metric returns a count of the job steps that failed in the latest execution of all jobs on the instance. This should be a calculated measure, where the change is what should alert the DBA. If any step in a job fails in the latest execution, this alert counts those failures and returns a value. This means if a job step fails in a job, and the job still reports success, this value changes. If the job re-runs, and no steps error out, then this metric would report a 0.
The T-SQL query that will collect data
Instances to collect from
all
Databases to collect from
msdb
Collection frequency
every hour
Use collected or calculated values
Leave the Use a calculated rate of change between collections check box unchecked
Metric collection
ENABLED
Alert definition
Alert name
Job Steps Have Failed
Description
Raise an alert when the metric value goes
above the defined threshholds
Default threshold values
High: | |
Medium: | |
Low: | 1 |
Raise an alert when the threshold is passed for
1 occurrence collections
Alert is
ENABLED