Sort stage with duplicate values in input

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
aaikat
Participant
Posts: 47
Joined: Tue Mar 07, 2006 2:49 am

Sort stage with duplicate values in input

Post by aaikat »

I have a sequential file as Sort stage input containing data like :

empid
1
2
3
1
2
4
4

Now when I use the Sort stage on this input file with ascending empid (Allow Duplicate=True,all other options default), I get the following output

empid
3
1
1
2
2
4
4

which shows repeating values are sorted but non-repeating value('3') comes in wrong position.

Please explain this behaviour.
balajisr
Charter Member
Charter Member
Posts: 785
Joined: Thu Jul 28, 2005 8:58 am

Post by balajisr »

what is your target?
aaikat
Participant
Posts: 47
Joined: Tue Mar 07, 2006 2:49 am

Post by aaikat »

My target is a data set
balajisr
Charter Member
Charter Member
Posts: 785
Joined: Thu Jul 28, 2005 8:58 am

Post by balajisr »

Dataset displays data from more than one partitions and the data is sorted within partitions in dataset.

Write the data to the sequential file and find out if there are any differences.
aaikat
Participant
Posts: 47
Joined: Tue Mar 07, 2006 2:49 am

Post by aaikat »

yes...It is appearing correctly in sequential file..thanks
Post Reply