My requirement was to remove the duplicate records from the source(Sequential file).So i was taking a sequential file-->Remove duplicate stage-->sequential file.
There are two columns say col1,col2.I gave duplicate to retain option as --'first'.
When i gave partition type option as 'same', the duplicates are being removed and the output was also sorted based on the 'col1'.Now when i give partition type as 'random' or 'range' ...the duplicate records are not being removed and instead all the records are coming...
Why was it happening like that...
![Sad :(](./images/smilies/icon_sad.gif)