Using RCP in a job that has multilpe join stages

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
pdntsap
Premium Member
Premium Member
Posts: 107
Joined: Mon Jul 04, 2011 5:38 pm

Using RCP in a job that has multilpe join stages

Post by pdntsap »

Hello,

There is a parallel job that has multiple (4-5) join stages. The join keys are about 10 in each stage but the input into the join stages have anywhere between 70-120 columns. I am looking at ways to decrease the total run time for the job. I am wondering if using RCP will help me in this process. I have come across posts that say RCP does not work well with joins and other posts that say RCP does not decrease the total run time of the job. Any thoughts in helping me decide for/against RCP is greatly appreciated.

Thanks.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

RCP has nothing to do with 'reducing the total run time' of a job, it's all about creating more 'generic' jobs.
-craig

"You can never have too many knives" -- Logan Nine Fingers
pdntsap
Premium Member
Premium Member
Posts: 107
Joined: Mon Jul 04, 2011 5:38 pm

Post by pdntsap »

Thanks Craig.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

So, look closer at your job design, see if anything about it can be made more... optimal.
-craig

"You can never have too many knives" -- Logan Nine Fingers
Post Reply