Page 1 of 1

join stage issue

Posted: Wed Jul 29, 2009 12:40 am
by dnat
Hi

I am joining two sources(a file and a table) with one key and if there is no match, then i am joining the non-matched records with some other field of the table.

I am just using Auto-partition throughout the job.

The first join is working properly..but the second join is not..If it is a patrition problem it should affect both isnt it..

Posted: Wed Jul 29, 2009 1:13 am
by mandyli
Hi,
How are collecting non matching records out of a join? Is there any provision in the Join stage to collect them? Please be more clear.

Thanks.

Posted: Wed Jul 29, 2009 1:17 am
by ArndW
Try running your job with a 1-node configuration and then with a 2-node configuration. If the results are different, then you have a partitioning problem that you need to correct.

Posted: Wed Jul 29, 2009 6:26 pm
by keshav0307
have you checked the key checkbox?
can you use hash partition?

Posted: Wed Jul 29, 2009 7:15 pm
by ddevdutt
The keys you partition on in the first join will be different from the keys you partition on in the second join.

What are the various stages in your job?

Posted: Fri Jul 31, 2009 1:27 am
by dnat
I made the second join to run in sequential, but still some of the records are not joined properly.

Posted: Fri Jul 31, 2009 2:17 am
by Sainath.Srinivasan
Did you check the datatype, length etc of the joining columns ?

Is it a case sensitive match ?

What do the records which do not join have in common? Try to locate the keys of missing records in the counterpart source manually.

Posted: Fri Jul 31, 2009 3:32 am
by dnat
It is a varchar field of length 13 on both the links. When i query it from the SQL i am getting an output, but when the join happens it doesnt match..It is a case insensitive match only.

Posted: Sun Aug 02, 2009 8:44 am
by Sreenivasulu
Some control characters might be appended to data in the database.
Use rtrim , ltrim on the data present in the database.

Then run the job.

Regards
Sreeni