Page 1 of 1

i have 2 sequential files

Posted: Fri Dec 17, 2010 7:24 pm
by dssubhani
i have 2 sequential files in which
one file having firstname,lastname

2nd file having firstname,e-mailaddress

now i want output as
firstname,lastname,e-mail-address...........how?

i used join stage join type as inner it does n't work...

Posted: Fri Dec 17, 2010 7:45 pm
by stuartjvnorton
You tell us. You've given us practically nothing to go on. Unless you can get most or all of the last name out of the email address, you won't have enough for a reliable match.
One thing I do know is that an inner join is not the way to go about it.

Posted: Sun Dec 19, 2010 8:36 pm
by abhijitg
Hi,

Make sure you sort the incoming records on the key prior to the join and it should work. Your problem might be the selection of a bad key column rather the join. The first name might not uniquely identify a row which might result in cross joins. Secondly by selecting an inner join you might be dropping records (not everyone has an email address).

Thanks
Abhijit

Posted: Sun Dec 19, 2010 8:43 pm
by abhijitg
Hi,

Using a lookup stage might be more appropriate, with the fname,lname file as your input and the fname, email file as you reference link. The bad choice for the key still holds.

Thanks
Abhijit

Posted: Sun Dec 19, 2010 8:44 pm
by asyed
abhijitg wrote:Hi,

Make sure you sort the incoming records on the key prior to the join and it should work. Your problem might be the selection of a bad key column rather the join. The first name might not uniquely identify a row which might result in cross joins. Secondly by selecting an inner join you might be dropping records (not everyone has an email address).

Thanks
Abhijit
In addtion to the above

a) Hash Partition the input
b) Maintain consistency in the Input key fields (ie. Trim, Upper/Lower Case).