How to improve performance..

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
Das
Participant
Posts: 87
Joined: Tue Oct 24, 2006 9:58 pm
Location: india

How to improve performance..

Post by Das »

Hi All,
Plese advice ,Which is best way to use the stage to get the optmum performance.

1-Currently I am using remove duplicate stage to remove the duplicate records,but i think using of sort stage is better to do the same operation..
Please suggest..Can I achieve any performance improvement out of that..Please advice

2-Transformer and filter stage:Currently am using filter stage ,If i replace it with transformer (using DS8x) can i expect any performance imrovement?..Please advice

3-During my loading process am creating N nubber of datasets,The same datasets will get overwrite in next run.The questing here is Can I expect any prformance improvemnt,If I delete the Datasets using orchadmin delete command after each run,Insted of overwrite though job.?Please give me your comments/suggestions

3-In most of the jobs ,I am crating 10-20 datasets in a final transformer and also that is a reject out put after a look up,Now the process looks very sow..Please advise..?
gssr
Participant
Posts: 243
Joined: Fri Jan 09, 2009 12:51 am
Location: India

Post by gssr »

You can hit some tutorial white papers in your favorite search engine for this issue.... :arrow:
RAJ
Das
Participant
Posts: 87
Joined: Tue Oct 24, 2006 9:58 pm
Location: india

Post by Das »

gssr wrote:You can hit some tutorial white papers in your favorite search engine for this issue.... :arrow:
Okey..But i want to know something about from ral datastge gurus or spcific issues...
kiran259
Participant
Posts: 48
Joined: Thu Aug 16, 2007 11:17 pm
Location: United States
Contact:

Post by kiran259 »

First,How do you define performance?Actually,it depends on various factors like hardware config,CPUs used,volume of source data,etc..Performance improvement is like an iteration process and get the best possible solution.
-Transformer would be better than Filter due to various reasons
Moreover,without knowing your requirement,conclusions cannot be perceived.

Do you have an interview soon? :wink:
Kiran Vaduguri

As soon as the fear approaches near, attack and destroy it.
Das
Participant
Posts: 87
Joined: Tue Oct 24, 2006 9:58 pm
Location: india

Post by Das »

kiran259 wrote:First,How do you define performance?Actually,it depends on various factors like hardware config,CPUs used,volume of source data,etc..Performance improvement is like an iteration process and get the best possible solution.
-Transformer would be better than Filter due to various reasons
Moreover,without knowing your requirement,conclusions cannot be perceived.

Do you have an interview soon? :wink:
The things which i have mentioned here are some facts,About the jobs in my project...The issues mentioned here are the identified facts..
My enviroment is 4 node,MPP system...And souce data is around 50k record in every 30 mts.Now the entire process is taking 1.2 hrs,I need to reduce it to 30 mts...and os is SunOS 5.10..

If you can comment on this it woluld be better for you also..may it will help you in interviews...also..:)
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

1. Try it, let us know.

2. No.

3. No.

4. Identify the bottleneck. Based on such limited information I suggest the lookup.
-craig

"You can never have too many knives" -- Logan Nine Fingers
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Before you do or worry about anything else, define "performance" in an ETL context. To my mind, the metric of performance is unlikely to be any kind of rate (particularly not rows per second). The main driver of ETL effort should - nay, must - be the ability to perform extraction and load phases within the time windows available, with a margin for safety.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply