Page 1 of 1

CPU AND LINK PARTIONERS

Posted: Fri Mar 25, 2005 11:01 am
by kollurianu
Hi All,

Can please shed me some light on like how many partions ( link partioner) are good to get good results with 2 cpus


thank you

Posted: Fri Mar 25, 2005 4:24 pm
by ray.wurlod
Not more than four, and that may be reduced by whatever else is happening in the job, particularly between the link partitioner and link collector stages.

Posted: Sun Mar 27, 2005 4:24 am
by roy
Hi,
naturally you must also concider the machine load at the time you want to run the job.
bare in mind 1 process for the link partitioner and 1 for the collector making 2, plus depending on your design x processes where x stands for the number of links you split it to; making a minimum total of 2 + x processes.
as Ray said it depend on what you implement before,between and after the link partitioner/colector stages.

IHTH,

Posted: Wed Mar 30, 2005 12:51 pm
by kollurianu
still iam not clear as to how many partions i need to use for link partioner

if i have to 2 cpus and how to determine this number on what basis.

thank you all

Posted: Wed Mar 30, 2005 2:38 pm
by chucksmith
When your job completes, what % CPU does the job monitor in DataStage Director show?

Let's say it says 50%. Since you have 2 CPUs, you have 200% available.

If your job is the only job running, then 3 or 4 partitions would be possible.

That is:

Code: Select all

200 % available CPU
---------------------------------- = 4 partitions
50 % CPU used by 1 partition
If the CPU statistics are not available in the job monitor, you can do a similar calculation based upon the sum of all CPU times from the finishing records in the job log divided by the elapse run time of the job.

Posted: Wed Mar 30, 2005 2:52 pm
by kollurianu
But when you develop a job in the development environment and
then some time you run on production environment , so how do determine
how partitions are good for optimal performance for that job.

thank you very much

Posted: Wed Mar 30, 2005 3:20 pm
by mhester
I believe the answer to your question is not as simple as some algorithm or formula that can be given to you here. I'm not sure Ascential publishes such information in a definitive way. My experience has always been that there is a point of diminishing return, meaning that at some point the overall performance of the job will suffer as you add more links/processes. I have found the types of transformations (if any) that happen between the partitioner and collector make a difference as well as the source and target.

I have found on a Wintel box with 8 procs (in our configuration) that 4 streams worked very well while 5 caused a significant slow down. I would think this would be different on Unix and have witnessed this to be true.

You also have to be concerned with what other processes are running on the box like Oracle, SQL Server, or other applications. These will and does have a direct impact on how processing takes place and will also help dictate how to partition the data.

I suggest you add links till it hurts and then back off and namely you should play around with different configurations to find the one that is best suited to your environment.

Regards,

Posted: Wed Mar 30, 2005 3:50 pm
by kollurianu
well, thanks for all ur inputs , this looks like variable performance , on
production environment u can never when u need to run the job and at that time how the job is going to perform , depending on the cpu availability.

Can any one shed me light on how link partioners and multiinstance job are related ,

exactly how does multiinstance job work and in which scenarios it is used.


Thank you all once again.

Posted: Wed Mar 30, 2005 4:00 pm
by chucksmith
They can provide you with similar function. However, with multi-instance jobs, you must be able to partition your input, and ensure you do not have any contention issues with your outputs. My opinion is that partitioner/collector pairs give you more control over the parts of a job that you parallelize.

Still, efficient routines and derivation should be your first concern.

Posted: Wed Mar 30, 2005 4:20 pm
by kollurianu
which one is better to user multiinstance or linkpartitioners and collectors

thank you all once again