Extraction Job which one is faster?

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
williamswe
Participant
Posts: 11
Joined: Mon Dec 05, 2005 4:11 am

Extraction Job which one is faster?

Post by williamswe »

Hi all,

I am extracting aggregated data from the warehouse using oracle sql ( invovled joining dimension and fact tables , summing up some measures, some translation of data column and sorting )
Now i need to build the DS job for it.

-should i get better performance if I break the logic down into a series of stages ( like ext, join, xfm, aggre, sort...etc )?
-Or let the Oracle handle all the processing? ( I am pulling data from one source only )

Thanks and Regards,
william
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

William,

there is no single good answer to that. If your SQL logic includes a big reduction in the amount of data then often it is faster to do it in the database and save I/O. Sometimes the DB machine is more heavily loaded than the DS or ETL machine, so processing can be faster inside DataStage.
kumar_s
Charter Member
Charter Member
Posts: 5245
Joined: Thu Jun 16, 2005 11:00 pm

Post by kumar_s »

It also depeds on where you database resides. If it is costly have carry huge number of rows from database server to DS server, it might be effecient to do the necessay aggregation steps from DS by either stroed procedure or using your SQL logic, and carry the necessay records for transformation to DS server.
Impossible doesn't mean 'it is not possible' actually means... 'NOBODY HAS DONE IT SO FAR'
DSguru2B
Charter Member
Charter Member
Posts: 6854
Joined: Wed Feb 09, 2005 3:44 pm
Location: Houston, TX

Post by DSguru2B »

Also, you need to always keep in mind, what time, or what part of day or night is your process going to run when it goes live. If during that time, not much database activity is taking place, then i would suggest doing aggregations sorting and joins at the database level.Transformations can be done in datastage.
My 2 centy.
Creativity is allowing yourself to make mistakes. Art is knowing which ones to keep.
Post Reply