How to schedule a job to run every 5 minutes

rajeshknl · Post by **rajeshknl** » Wed Sep 03, 2008 4:28 pm

Hello

I recieve a file every 5 minutes(24x7). They are all of the same filetype and processed by the same job. The sizes of the files might vary. For example file1 might be 10MB and file2 2MB.

The issue is i dont want to the stage the files and run one after another waiting for the previous file to be processed completely. I want to process the file as soon as i recieve it.

I know the concept of multiple instance but how do i use it? I mean how do i pass different filenames to different instances as parameters.

How i do i keep track of which file is being processed/waiting etc.

I also want to limit the max no of instances to say 20 so that my box does not crash.

Any ideas/experiences references are welcome.

priyadarshikunal · Post by **priyadarshikunal** » Thu Sep 04, 2008 1:10 am

rajeshknl wrote:Hello

I recieve a file every 5 minutes(24x7). They are all of the same filetype and processed by the same job. The sizes of the files might vary. For example file1 might be 10MB and file2 2MB.

The issue is i dont want to the stage the files and run one after another waiting for the previous file to be processed completely. I want to process the file as soon as i recieve it.

I know the concept of multiple instance but how do i use it? I mean how do i pass different filenames to different instances as parameters.

How i do i keep track of which file is being processed/waiting etc.

I also want to limit the max no of instances to say 20 so that my box does not crash.

Any ideas/experiences references are welcome.

There are a few contraditions in what you want. But i hope i am answering it correctly.

what you need to do is

1. allow multi instance for that job.
2. checking new file in your directory every five minutes. you may store timestamp for previous file and compare it with the latest one.
3. If you get that file, pass the file name as parameter to the job.
4. the invocation id for that job needs to be unique and your instances should be dynamic so you can use current date time as invocation id.

also you don't want instances more than 20 so you can count the number of instances running from the command

Code: Select all

ps -fu <username>|grep <jobname> |wc -l

if its more than 20 send the command to another script block that checks the number of jobs running every minute and if count falls down runs your command.

this is the best way i can think now (will think on other aspects of this). Please correct me if i misunderstood your requirement.

ray.wurlod · Post by **ray.wurlod** » Thu Sep 04, 2008 2:04 am

This is a good case to write specialized job control code.

Keep an array of job names currently executing, and a similar array of job handles.

You can discover all the information you need from these arrays.

Also keep a scalar variable recording the number of jobs currently running, and don't start any more until one (at least) finishes.

Remove the jobs from the arrays once you've finished any post-execution processing.

AmeyJoshi14 · Post by **AmeyJoshi14** » Thu Sep 04, 2008 6:35 am

Hi,
As per the priyadarshikunal's post ,that is the points mention in the post , i have created a script which might help you to solve your problem.

The script is :

Code: Select all

#!/bin/ksh
DIRPATH=path of the directory
#Assuming the directory is empty or running for the first time
cntprev=1
#This script will run continuously
some=1
while [ $some -ne 150 ] #so that script will run continously
do
  cntorg=`ls -ltr | wc -l`  #taking the count
  cntnew=`expr $cntorg - 1` 
  #Now comparing the value
  if [ $cntnew -ne $cntprev ]
    then
        numjobs=`ps -fu <username>|grep <jobname> |wc -l` 
        #To check that not more than 20 instances are running
    		if [ $numjobs -lt 20 ]
    		then
    		invoctid=$cntnew
    		. $DSHOME/dsenv
    		cd $DSHOME/bin 
    		dsjob -run Project_name Job_Name.$invoctid 2>dev/null
  fi  	
  	cntprev=$cntnew
  	sleep 60
done

chulett · Post by **chulett** » Thu Sep 04, 2008 7:16 am

I would stick with the approach Ray mentioned and have implemented the very same myself, or at least something very similar using the techniques mentioned.

ray.wurlod · Post by **ray.wurlod** » Thu Sep 04, 2008 7:29 am

AmeyJoshi, why do you discard stderr from the dsjob command? I would expect to have this information available in the event of failure.

AmeyJoshi14 · Post by **AmeyJoshi14** » Thu Sep 04, 2008 7:56 am

ray.wurlod wrote:AmeyJoshi, why do you discard stderr from the dsjob command? I would expect to have this information available in the event of failure. ...

Hi,
Since this script will run continuously , so every time when it runs sucessfully it will show the 'status code=0' , due to this i have discarded the stderr . I have not thought of this option..

Craig Guruji : i have just thought of expanding priyadarshikunal's idea a bit

......that's why i have posted it nothing else

and i also do think Ray guruji's option is good infact best..

DSXchange

How to schedule a job to run every 5 minutes

How to schedule a job to run every 5 minutes

Re: How to schedule a job to run every 5 minutes