Comparision of Stage Variable & Transformer

Roopanwita · Post by **Roopanwita** » Thu Jul 24, 2008 10:58 pm

Hi,
In a job design to achieve a particular transformation ,which approach will be better form performance tuning point of view ,using 3 TRansformer & 20 stage variable (source ,target and datavolume are same ).

Which job will take lesser time.Its not an interview question. I am trying to do performance tuning.

Thanks,
Roopanwita

ray.wurlod · Post by **ray.wurlod** » Fri Jul 25, 2008 12:27 am

You only have one "alternative" stated. Therefore it will perform best.

Roopanwita · Post by **Roopanwita** » Fri Jul 25, 2008 1:31 am

Hi,
Sorry I made a mistake in the post,
my question is: implementing a job logic using 3 Transformer back to back or 20 stage variable in single transformer (source ,target and datavolume are same ).

Thanks,

ray.wurlod · Post by **ray.wurlod** » Fri Jul 25, 2008 4:28 am

What else are the Transformer stages doing? Do you have inter-process row buffering enabled? When you watch this job in the Monitor, how much CPU is each Transformer stage using? Are they operating in one process or in separate processes?

Roopanwita · Post by **Roopanwita** » Fri Jul 25, 2008 6:50 am

Hey! Thanks for your reply!

How to see CPU usage for a particular stage..

sud · Post by **sud** » Fri Jul 25, 2008 8:32 am

Roopanwita wrote:Hey! Thanks for your reply!
How to see CPU usage for a particular stage..

In director, right click on the job and choose Monitor. Read through the columns that appear - if you don't find %CP right click on the columns and choose that. Alternatively, you can set the APT_PM_PLAYER_TIMING env variable to see the CPU usage for each operator.

vmcburney · Post by **vmcburney** » Sun Jul 27, 2008 5:12 pm

Would combine operators merge the three Transformers into one? I would guess that a single Transformer would work best but the easiest way to tell is to design the job both ways and run a test. The one time I would use two or more Transformers in a row is if the first Transformer has a constraint that removes a lot of rows to lighten the load on the subsequent Transformers.