Hi all,
I'm having a remove duplicate stage for which input link is from Copy Stage. I'm doing hash partition on x, y, z column in the copy stage and Same Partition is used on Remove Duplicate Stage with x, z, y as key columns to remove duplicate.
Should the hash partition column order (xyz) and the Remove duplicate key (xzy) order be exactly identical?
Thanks in advance,
Poova.
Remove duplicate
Moderators: chulett, rschirm, roy
-
- Participant
- Posts: 111
- Joined: Mon Nov 30, 2009 7:21 am
- Location: Bangalore
-
- Participant
- Posts: 62
- Joined: Sat Mar 07, 2009 4:59 am
- Location: Chicago
- Contact:
-
- Participant
- Posts: 3337
- Joined: Mon Jan 17, 2005 4:49 am
- Location: United Kingdom
The order of key definition is important to DataStage, sorting on columns A,B,C and then doing a remove duplicates on keys C,B,A will generate a warning message at runtime.
<a href=http://www.worldcommunitygrid.org/team/ ... TZ9H4CGVP1 target="WCGWin">
</a>
</a>
-
- Participant
- Posts: 111
- Joined: Mon Nov 30, 2009 7:21 am
- Location: Bangalore