Page 1 of 1

Join Stage error

Posted: Mon Aug 02, 2010 11:18 am
by developeretl
Hi,
Iam trying to join the two datasets, one of the containing the 6 coumns and other containing 4 columns. Iam doing the inner join based on the key=col1 . In the output iam getting 0 rows, and iam getting error saying that columns have been droped out . This is the error "Join_267: When checking operator: Dropping component "Col2" because of a prior component with the same name. It for col3 and col4 . Can anyone help me why iam getting this error ???

Dataset 1 Dataset2
Cl1 Cl1
Cl2 Cl2
Cl3 Cl3
Cl4 Cl4
Cl5
Cl6

Posted: Mon Aug 02, 2010 11:23 am
by vinothkumar
Your non key columns to join stage is having same names in both links I guess. Try renaming the non key column names in one of the links and see.

Re: Join Stage error

Posted: Mon Aug 02, 2010 11:25 am
by creatingfusion
If you are doing Inner Join on the Col1 the output would have one record.
So try to remove the duplicates with key being Col1 and then sort the records using the Hash Partitioning, before you do the join ...
Probably on doing that you wont be getting this error.
Thanks
Abhijit

Posted: Mon Aug 02, 2010 2:16 pm
by ray.wurlod
The alert message is telling you that there can only be one field called col2 on the output, but you have one on each input. The message is informing you which one was used and which one was not used (dropped). If you want to preserve both, you need to name them differently on the inputs.

Make sure, too, that both inputs are partitioned and sorted on the Join keys.

Posted: Fri Aug 06, 2010 4:24 am
by srinivas.g
Try to enable the RCP in your output link.

It will resolve..........

Posted: Fri Aug 06, 2010 6:29 am
by ArndW
Turning on RCP doesn't change the behaviour that 2 columns with the same name exist but only one can be output.