Hi,
I have two files both containing around 3mn records. I wanted to check out which option would be fastest for joining the data. So I used same set of files in different jobs on Join, Merge and Lookup but am getting different matched records. The Join and Merge show same set of matched records, but lookup shows a totally different number.
On furthere analysis the Lookup output is correct. I'm trying to figure out what is wrong with Join and Merge.
I have already ensured that the keys are hash partitoned and sorted.
What else could I be missing?
Join and Merge results different from Lookup
Moderators: chulett, rschirm, roy
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
HOW (on what columns) is the partitioning done? What partitioning are you using for the Lookup stage? Have you ensured that inputs to Join and Merge are correctly sorted?
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact: