Background
IT-Conductor can automate the whole process of job abort detection, restarting SAP failed jobs, and notifying the appropriate job owner, including the job log as an attachment. In advanced cases, IT-Conductor can even restart the job with a specific variant, and/or from a specific step. Depending on the complexity of the conditions on how you want to restart a particular job, IT-Conductor can be configured to execute this process to reduce the MTTR (Mean Time to Repair).
Prerequisites
On your SAP environment, create a dedicated SAP service user to monitor and execute the jobs
On the IT-Conductor main menu, navigate to Support → Downloads → SAP Security Downloads, and download the SAP NW Batch Scheduling Role
Assign this role to the recently created job monitoring SAP user using PFCG transaction code
Navigate to a system where you’ll be creating batch jobs and select “Accounts” on the main menu
Create an automation account on IT-Conductor and associate it with the previously created SAP account. Give the user a descriptive name.
Create threshold override for job restart
You may create a threshold override from a template. IT-Conductor has templates for all metrics. In this case, since we want to restart a job after it’s failed or it’s been aborted, we’re going to navigate to the existing overrides for this metric.
Navigate to System → Background jobs → Aborted → Threshold override
Click on Create Override from template
Click on the template to create a new override
Click on Save when you’re done
Create a recovery activity to restart the job
A recovery activity it’s an option that allows you to automatically take action whenever an incident occurs in IT-Conductor.
Click back to the recently created Threshold override and scroll down to the “Recovery” section.
To turn on the recovery activity, select “Warning”, or “Alarm” on the “Recovery on” option
If you select “Warning”, the recovery activity will run when the “Warning Value” field = 1
If you select “Alarm”, the recovery activity will run when “Alarm Value” field = 2
Select a recovery activity from the “Recovery” list. In this case, we’re going to select the activity for Copy and Start Job.
Select the previously created automation user as “Owner”
Check the “Alert” box if you want to be alerted whenever this recovery activity occurs
Save Recovery Activity
0 Comments