Difference stage problem

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
splayer
Charter Member
Charter Member
Posts: 502
Joined: Mon Apr 12, 2004 5:01 pm

Difference stage problem

Post by splayer »

I am using the difference stage to find the difference between 2 files, FileA and FileB. I copied FileA into FileB. However, the difference stage still finds most of the rows different between the files. The diff column returns these values in the output dataset:

0
1
2
2
0
0
1
1
2

What's interesting is, the 2 files have only 6 rows. The 3rd column is a numeric column and was the key column. All columns are char type.
nick.bond
Charter Member
Charter Member
Posts: 230
Joined: Thu Jan 15, 2004 12:00 pm
Location: London

Post by nick.bond »

have you checked the files are the same using unix

Code: Select all

 diff file1 file2 
is your partitioning the same for the 2 inputs to the stage?
Regards,

Nick.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

If there's only six records in each file, why not post the contents of each?
-craig

"You can never have too many knives" -- Logan Nine Fingers
Post Reply