Sort stage warnings

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
kollurianu
Premium Member
Premium Member
Posts: 614
Joined: Fri Feb 06, 2004 3:59 pm

Sort stage warnings

Post by kollurianu »

Hi All,

I am getting following warnings in sort stage,

sort_time_diff: Data claims to already be sorted on the specified keys the 'sorted' option can be used to confirm this. Data will be resorted as necessary. Performance may improve if this sort is removed from the flow

Any thoughts why I am getting above warnings can some shed light?

All I am trying to do is sorting that stage and in the next stage remove duplicates.

appreciate your help in advance.

Thank you all.
divesh
Participant
Posts: 7
Joined: Wed Dec 20, 2006 5:18 pm

Re: Sort stage warnings

Post by divesh »

Take out the sort stage and do the hash partition and perform the presort on the Input tab in remove duplicate stage.
kollurianu
Premium Member
Premium Member
Posts: 614
Joined: Fri Feb 06, 2004 3:59 pm

Post by kollurianu »

I did that still remove duplicates is giving a warning as below

rmv_dup: When checking operator: User inserted sort "rmv_dup.lnk_to_rem_dup_Sort" does not fulfill the sort requirements of the downstream operator "rmv_dup"
kwwilliams
Participant
Posts: 437
Joined: Fri Oct 21, 2005 10:00 pm

Post by kwwilliams »

Two things. Its appears that you are sorting your data multiple times, that is why you are getting this warning:

sort_time_diff: Data claims to already be sorted on the specified keys the 'sorted' option can be used to confirm this. Data will be resorted as necessary. Performance may improve if this sort is removed from the flow

It is saying you've already done this once before, you don't need to do this again. If you would like you can change the sort method to "don't sort previously sorted".

On your remove duplicate stage, your key columns must be listed int eh same order in the remove duplicate stage as they are in the sort stage (or link) and the options for the sort and remove duplicates need to be the same (Case Sensitive and EBCDIC). If the order or options are different this warning will appear because the sort and remove duplicate options are based upon two different algorithims.
Post Reply