Use of copy Stage In PX

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
Sreedhar
Participant
Posts: 187
Joined: Mon Oct 30, 2006 12:16 am

Use of copy Stage In PX

Post by Sreedhar »

HI

I am using the copy stage in one of my job (to achieve the functionality) 4 times... does this by any chance reduce the performance of the job.

If yes what are the other stages which can be used as an alternative to the copy stage to improve the performance.

If no, does this have any other implications on the job.
Regards,
Shree
785-816-0728
vmcburney
Participant
Posts: 3593
Joined: Thu Jan 23, 2003 5:25 pm
Location: Australia, Melbourne
Contact:

Post by vmcburney »

If you want to use reference data more than once, or use different columns from the same table/file in more than one lookup, the copy stage is excellent for performance. The source data is converted into native parallel data just once and used in different places in the job.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Copy is the most lightweight of the stages that can do what it does.

However a fundamental rule of computer processing - irrespective of what tool or program you are using - is that every unit of work you add consumes some CPU cycles (and probably other resources too).

So, while the Copy stage itself may not consume much in the way of resources, the extra work you are doing on the three new streams may.

This is not a DataStage problem. The problem here lies between the keyboard and the chair.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply