Finding Job Step Failures

Install metric...

Metrics install automatically if you have Redgate Monitor installed.

If you are using Redgate’s SQL Server monitoring tool, Redgate Monitor, you can instantly install and run this metric on your servers.

This metric returns a count of the job steps that failed in the latest execution of all jobs on the instance. This should be a calculated measure, where the change is what should alert the DBA. If any step in a job fails in the latest execution, this alert counts those failures and returns a value. This means if a job step fails in a job, and the job still reports success, this value changes.

If the job re-runs, and no steps error out, then this metric would report a 0.

Metric definition

Name

Finding Job Step Failures

Description

This metric returns a count of the job steps that failed in the latest execution of all jobs on the instance. This should be a calculated measure, where the change is what should alert the DBA. If any step in a job fails in the latest execution, this alert counts those failures and returns a value. This means if a job step fails in a job, and the job still reports success, this value changes. If the job re-runs, and no steps error out, then this metric would report a 0.

The T-SQL query that will collect data

Instances to collect from

all

Databases to collect from

msdb

Collection frequency

every hour

Use collected or calculated values

Leave the Use a calculated rate of change between collections check box unchecked

Metric collection

ENABLED

Alert definition

Alert name

Job Steps Have Failed

Description

A step in a job failed, whether or not the job itself reported success.

Raise an alert when the metric value goes

above the defined threshholds

Default threshold values

High:
Medium:
Low:1

Raise an alert when the threshold is passed for

1 occurrence collections

Alert is

ENABLED