i have 2 sequential files

dssubhani · Post by **dssubhani** » Fri Dec 17, 2010 7:24 pm

i have 2 sequential files in which
one file having firstname,lastname

2nd file having firstname,e-mailaddress

now i want output as
firstname,lastname,e-mail-address...........how?

i used join stage join type as inner it does n't work...

stuartjvnorton · Post by **stuartjvnorton** » Fri Dec 17, 2010 7:45 pm

You tell us. You've given us practically nothing to go on. Unless you can get most or all of the last name out of the email address, you won't have enough for a reliable match.
One thing I do know is that an inner join is not the way to go about it.

abhijitg · Post by **abhijitg** » Sun Dec 19, 2010 8:36 pm

Hi,

Make sure you sort the incoming records on the key prior to the join and it should work. Your problem might be the selection of a bad key column rather the join. The first name might not uniquely identify a row which might result in cross joins. Secondly by selecting an inner join you might be dropping records (not everyone has an email address).

Thanks
Abhijit

abhijitg · Post by **abhijitg** » Sun Dec 19, 2010 8:43 pm

Hi,

Using a lookup stage might be more appropriate, with the fname,lname file as your input and the fname, email file as you reference link. The bad choice for the key still holds.

Thanks
Abhijit

asyed · Post by **asyed** » Sun Dec 19, 2010 8:44 pm

abhijitg wrote:Hi,

Make sure you sort the incoming records on the key prior to the join and it should work. Your problem might be the selection of a bad key column rather the join. The first name might not uniquely identify a row which might result in cross joins. Secondly by selecting an inner join you might be dropping records (not everyone has an email address).

Thanks
Abhijit

In addtion to the above

a) Hash Partition the input
b) Maintain consistency in the Input key fields (ie. Trim, Upper/Lower Case).