Hi,
Iam trying to join the two datasets, one of the containing the 6 coumns and other containing 4 columns. Iam doing the inner join based on the key=col1 . In the output iam getting 0 rows, and iam getting error saying that columns have been droped out . This is the error "Join_267: When checking operator: Dropping component "Col2" because of a prior component with the same name. It for col3 and col4 . Can anyone help me why iam getting this error ???
Dataset 1 Dataset2
Cl1 Cl1
Cl2 Cl2
Cl3 Cl3
Cl4 Cl4
Cl5
Cl6
Join Stage error
Moderators: chulett, rschirm, roy
-
- Participant
- Posts: 342
- Joined: Tue Nov 04, 2008 10:38 am
- Location: Chennai, India
-
- Participant
- Posts: 46
- Joined: Tue Jul 20, 2010 1:26 pm
- Location: USA
- Contact:
Re: Join Stage error
If you are doing Inner Join on the Col1 the output would have one record.
So try to remove the duplicates with key being Col1 and then sort the records using the Hash Partitioning, before you do the join ...
Probably on doing that you wont be getting this error.
Thanks
Abhijit
So try to remove the duplicates with key being Col1 and then sort the records using the Hash Partitioning, before you do the join ...
Probably on doing that you wont be getting this error.
Thanks
Abhijit
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
The alert message is telling you that there can only be one field called col2 on the output, but you have one on each input. The message is informing you which one was used and which one was not used (dropped). If you want to preserve both, you need to name them differently on the inputs.
Make sure, too, that both inputs are partitioned and sorted on the Join keys.
Make sure, too, that both inputs are partitioned and sorted on the Join keys.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
-
- Participant
- Posts: 251
- Joined: Mon Jun 09, 2008 5:52 am
Turning on RCP doesn't change the behaviour that 2 columns with the same name exist but only one can be output.
<a href=http://www.worldcommunitygrid.org/team/ ... TZ9H4CGVP1 target="WCGWin">
</a>
</a>