Page 1 of 1

Remove Duplicate PRoblem when using 2X

Posted: Mon Nov 15, 2010 12:52 pm
by _chamak
i have a job which is currently runing fine with 1X but having problem then i change it into 2X , have a sort followed by remove duplicates. Data is hash partitioned in sort based on key columns the data type fr both of the is integer. The remove duplicates when using 1X is removing 148 rows but if i use 2X it only removed 147. Can any one help me with this

Posted: Mon Nov 15, 2010 2:12 pm
by ray.wurlod
Welcome aboard.

Are the data partitioned on the "key used to determine duplicates"?

Posted: Mon Nov 15, 2010 4:00 pm
by _chamak
ray.wurlod wrote:Welcome aboard.

Are the data partitioned on the "key used to determine duplicates"?
i am partitioning on the key columns in the sort before remove duplicates.

Posted: Mon Nov 15, 2010 11:01 pm
by ray.wurlod
Are the sort keys the same as the remove duplicate keys?