Problem with Joiner stage
Posted: Tue Feb 12, 2008 12:46 am
Hi All,
I have a scenario where is I am joining 2 datasets in a joiner.The dataset A.ds has 479 records and dataset B.ds has 383 records.I am doing inner join which should give 479 records in output.
I also have sort stage before join stage.I am sorting on the key which I also use to join.I tried these things:
1. I hash partitioned the inputs based on the same key on which I sort/join which geve me 518 records as output.
2. I removed partitioning and just sorted data before joining..which gave me 14 records as output.
3. I used lookup just for testing and it gave me 479 records which is proper.
I am not able to understand whats wrong here when i use join stage.If someone has some idea about this please let me know.
Thanks in advance
Shiva
I have a scenario where is I am joining 2 datasets in a joiner.The dataset A.ds has 479 records and dataset B.ds has 383 records.I am doing inner join which should give 479 records in output.
I also have sort stage before join stage.I am sorting on the key which I also use to join.I tried these things:
1. I hash partitioned the inputs based on the same key on which I sort/join which geve me 518 records as output.
2. I removed partitioning and just sorted data before joining..which gave me 14 records as output.
3. I used lookup just for testing and it gave me 479 records which is proper.
I am not able to understand whats wrong here when i use join stage.If someone has some idea about this please let me know.
Thanks in advance
Shiva