Maximum number of jobs per project...

A forum for discussing DataStage<sup>®</sup> basics. If you're not sure where your question goes, start here.

Moderators: chulett, rschirm, roy

soportesis
Premium Member
Premium Member
Posts: 21
Joined: Thu May 31, 2007 8:58 am

Maximum number of jobs per project...

Post by soportesis »

Hello,

I have a project in DataStage 7.5.2 EE on UNIX with more than 1000 jobs, but I'm not sure if having many jobs into a project is a best practice.
  • Are there some recommendation about of the maximum number of jobs per project?

    What risks are involved in a project that has many jobs?

    Do you know where can I learn more about this topic?
Tnks a lot.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

It depends to some extent upon what flavour of UNIX you are on, and on the limit (if any) on the number of subdirectories that can be created in a directory.

1000 jobs is probably OK - you can go up to somewhere near 4000. However, performance problems start kicking in in areas like job startup and job creation. They take longer than in a project with fewer jobs.

Another risk is the total volume of entries in job logs, which are all stored within the project directory - the danger is that of filling the disk that contains the project directory. Splitting the project into multiple projects on the same disk will not ameliorate this situation; splitting into multiple projects on separate disks definitely will.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
pandeesh
Premium Member
Premium Member
Posts: 1399
Joined: Sun Oct 24, 2010 5:15 am
Location: CHENNAI, TAMIL NADU

Post by pandeesh »

ray.wurlod wrote:It depends to some extent upon what flavour of UNIX you are on, and on the limit (if any)
What about SunOS?
is there any limit for that?
pandeeswaran
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

I believe the limit on Solaris generally to be 32K subdirectories per directory.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
soportesis
Premium Member
Premium Member
Posts: 21
Joined: Thu May 31, 2007 8:58 am

Post by soportesis »

Tnks Ray, it was very good information.
PhilHibbs
Premium Member
Premium Member
Posts: 1044
Joined: Wed Sep 29, 2004 3:30 am
Location: Nottingham, UK
Contact:

Post by PhilHibbs »

ray.wurlod wrote:I believe the limit on Solaris generally to be 32K subdirectories per directory.
And each job requires 6 directories, which gives the ~5000 job limit mentioned earlier.
Phil Hibbs | Capgemini
Technical Consultant
pandeesh
Premium Member
Premium Member
Posts: 1399
Joined: Sun Oct 24, 2010 5:15 am
Location: CHENNAI, TAMIL NADU

Post by pandeesh »

PhilHibbs wrote:And each job requires 6 directories, which gives the ~5000 job limit mentioned earlier.
What are those 6?
i remember RTSTATUS,RTCONFIG and RTLOG which is a hashed file.
Correct me if i am wrong.
And what will happen if we exceed the maximum limit?(While attempting to create the 5001th job if max limit is 5000)

Thanks
pandeeswaran
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

What are the six? Why not simply check and answer the question yourself?
-craig

"You can never have too many knives" -- Logan Nine Fingers
pandeesh
Premium Member
Premium Member
Posts: 1399
Joined: Sun Oct 24, 2010 5:15 am
Location: CHENNAI, TAMIL NADU

Post by pandeesh »

yes craig.

I am able to see the below inside project directory

1)RT_BP
2)RT_BP.O
3)RT_CONFIG(Hashed file)
4)RT_STATUS(Hashed file)
5)RT_LOG(Hashed file)

Am i missing anything?
Thanks
pandeeswaran
PhilHibbs
Premium Member
Premium Member
Posts: 1044
Joined: Wed Sep 29, 2004 3:30 am
Location: Nottingham, UK
Contact:

Post by PhilHibbs »

Er, it's possible that I got it wrong and that it's only 5 directories. Or, there may be a sixth possibility that doesn't always get created for all jobs. I'll see if I can dig up the thread where I learned about all this.
Phil Hibbs | Capgemini
Technical Consultant
pandeesh
Premium Member
Premium Member
Posts: 1399
Joined: Sun Oct 24, 2010 5:15 am
Location: CHENNAI, TAMIL NADU

Post by pandeesh »

PhilHibbs wrote:Er, it's possible that I got it wrong and that it's only 5 directories. Or, there may be a sixth possibility that doesn't always get created for all jobs. I'll see if I can dig up the thread where I learned about all this.
I am really curious to know in which scenario, the sixth will create?
if you get any link, please share with me.
Thanks
pandeeswaran
PhilHibbs
Premium Member
Premium Member
Posts: 1044
Joined: Wed Sep 29, 2004 3:30 am
Location: Nottingham, UK
Contact:

Post by PhilHibbs »

I found a couple of threads that mention this but none of them explain exactly what the 6 directories are.

e.g. viewtopic.php?t=103027, viewtopic.php?t=116443
Phil Hibbs | Capgemini
Technical Consultant
pandeesh
Premium Member
Premium Member
Posts: 1399
Joined: Sun Oct 24, 2010 5:15 am
Location: CHENNAI, TAMIL NADU

Post by pandeesh »

I guess DS_TEMP is the 6th one..but I don't know under which circumstances it will be created..correct me if I am wrong
Last edited by pandeesh on Mon Jul 25, 2011 10:45 pm, edited 1 time in total.
pandeeswaran
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

There could be as many as eight:
  • RT_BPnnn - source code from BASIC components

    RT_BPnnn.O - compiled BASIC components

    RT_CONFIGnnn - run-time configuration

    RT_LOGnnn - job logs

    RT_SCnnn - osh and C++ components

    RT_STATUSnnn - run-time status of jobs and resources

    DS_TEMPnnn - transient components during design

    RT_QSnnn - QualityStage components
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
pandeesh
Premium Member
Premium Member
Posts: 1399
Joined: Sun Oct 24, 2010 5:15 am
Location: CHENNAI, TAMIL NADU

Post by pandeesh »

Good info Ray!! generally it varies from 5 to 8.
From my understanding RT_SCnnn will be created only if the job conatins any shared containers(SC).RT_QSnnn will be created only if datastage jobs are integrated with Qualitystage.
Correct me if my understanding is incorrect.
Could you elaborate little bit about DS_TEMPnnn?Under which circumstances DS_TEMPnnn will be created?
Thanks
pandeeswaran
Post Reply