What is the most efficient way to compare 2 datasets, apart from the diff and change capture operators in Datastage. Is there any unix command to do that.
So what would be your suggestion i.e diff operator or change capture or can you think of anything else.
Let me know.
Thanks.
Raj
Hi Raj,
I'm assuming you're trying to find an efficient way to capture source data changes. Based on the way your asking, it doesn't seem like you need to be able to detect changes at the field level. If this is the case, why not use a simple DataStage routine to do record by record comparisons? Simply write all data you need compared to sequential files using standard delimiters.
R. Michael Pickering
Senior Architect
Cohesion Systems Consulting Inc.
I think we need to know a little more detail in order to point you in the right direction. The diff command will work but it is crude. There may be a better solution if we knew more about what you are trying to accomplish.
We want to do parallel testing between 20 different .ds files generated from DataStage ver 6.0 to 7.0 for the same batch day. So I need to make sure the data in the 2 sets of files is the same.
Please let me know if you have any more questions.