Datastage Sort in conjunction with Oracle Sort

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
kool_cons
Participant
Posts: 68
Joined: Thu Jul 07, 2005 3:41 pm

Datastage Sort in conjunction with Oracle Sort

Post by kool_cons »

We are using Oracle read parallesim in datastage to run the same query on multiple nodes..Later we have to sort the data to remove dups.Is it better perform if we do Oracle Sort and Again Datastage Sort or just do datastage Sort? If the data is sorted in oracle before it goes to Datastage Sort,Is datastage clever enough to know that data is already sorted and will not take more time to Sort entire data again in datastage?

Thanks In Advance.
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Sorting partially sorted data tends to be faster than sorting random data. You can insert a sort stage with "Don't sort, previously sorted" for the already sorted columns to let DS know that it doesn't need to sort data.
Post Reply