Background
IT-Conductor offers automation solution for handling the restart of SAP batch jobs when they fail. This covers the detection of aborted batch jobs, automatic restart of the failed jobs, and notification of the appropriate job owner, including the delivery of job log as an attachment. In advanced cases, IT-Conductor can also restart the job with a specific variant, and/or from a specific step. Depending on the complexity of the conditions on how you want to restart a particular job, IT-Conductor can be configured to execute this process to reduce the MTTR (Mean Time to Repair).
Prerequisites
On your SAP environment, create a dedicated SAP service user to monitor and execute the jobs
On the IT-Conductor main menu, navigate to Support → Downloads → SAP Security Downloads, and download the SAP NW Batch Scheduling Role
Assign this role to the recently created job monitoring SAP user using the PFCG transaction code
Navigate to a system where you’ll be creating batch jobs and select “Accounts” on the main menu
Create an automation account on IT-Conductor and associate it with the previously created SAP account. Give the user a descriptive name.
Create threshold override for job restart
You may create a threshold override from a template. IT-Conductor has templates for all metrics. In this case, since we want to restart a job after it’s failed or it’s been aborted, we’re going to navigate to the existing overrides for this metric.
Navigate to System → Background jobs → Aborted → Threshold override
Click on Create Override from template
Click on the template to create a new override
Click on Save when you’re done
Create a recovery activity to restart the job
A recovery activity it’s an option that allows you to automatically take action whenever an incident occurs in IT-Conductor. Recovery activities are predefined by IT-Conductor Support based on the required automation process or scenario.
Click back to the recently created Threshold override and scroll down to the “Recovery” section.
To turn on the recovery activity, select “Warning”, or “Alarm” on the “Recovery on” option
If you select “Warning”, the recovery activity will run when the Warning threshold is exceeded.
If you select “Alarm”, the recovery activity will run when the defined Alarm threshold is breached.
Select a recovery activity from the “Recovery” list. In this case, we’re going to select the activity for Copy and Start Job.
Select the previously created automation user as “Owner”
Check the “Alert” box if you want to be alerted whenever this recovery activity occurs
Save Recovery Activity
On the same Threshold Override template, if you wish to be notified when a job has failed, select either “Warning” or “Alarm” on the “Alert On” option
Example of a batch job recovery activity on IT-Conductor
0 Comments