Hi all,
When i am trying to remove duplicates from the iput link of the remove duplicate stage, it is giving an warning as,
Remove_Duplicates_367: When checking operator: User inserted sort "Remove_Duplicates_367.DSLink358_Sort" does not fulfill the sort requirements of the downstream operator "Remove_Duplicates_367"
I was using Hash partitioning and enabled Sorting option as 'Perform Sort'
What might be the possible problem...
Remove Duplicate Warning
Moderators: chulett, rschirm, roy
Remove Duplicate Warning
Thanks and Regards!!
dspxlearn
dspxlearn
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
The reason that partitioning must be the same is to guarantee that any duplicates occur on the same processing node.
The reason that sorting should occur is so that least memory can be consumed - once the sort key value changes, the stage can be certain that there will be no more matches against this value and can quickly discard any rows from the other input that share this key value.
The reason that sorting should occur is so that least memory can be consumed - once the sort key value changes, the stage can be certain that there will be no more matches against this value and can quickly discard any rows from the other input that share this key value.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.