Peformance of Transformer (with Stage Variables)

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
sswarup
Participant
Posts: 6
Joined: Fri May 28, 2004 6:11 am

Peformance of Transformer (with Stage Variables)

Post by sswarup »

Hi,
We are trying to identify the peformance bottlenecks in some of our jobs. In many cases, we have observed, that the Transformer stage is taking a long time, especially those that have Stage Variable defined.

Would we be better off by eliminating stage variables (if possible)?
chucksmith
Premium Member
Premium Member
Posts: 385
Joined: Wed Jun 16, 2004 12:43 pm
Location: Virginia, USA
Contact:

Post by chucksmith »

No. Stage variables are an extremely efficient way of implementing calculation that would otherwise be repetitive, and do not have the overhead of a routine call.

Be sure that your stage variables are being used that way. One can still code bad derivations in a stage variable, too.

Also, "a long time" could mean CPU bound, but it could also mean I/O bound.

Check all of the best practices when reviewing your job's performance.

Good luck.
vmcburney
Participant
Posts: 3593
Joined: Thu Jan 23, 2003 5:25 pm
Location: Australia, Melbourne
Contact:

Post by vmcburney »

Look for custom written routines used by those stage variables or within the derivations throughout the transformer as these can be bottlenecks if they are not well written.
Post Reply