Different results on multiple nodes
Posted: Thu Oct 27, 2011 7:06 am
When I am running a job on single node the data and no.of records I am getting are perfect , but when running the same job on 4 (or Multi) nodes, it is acting weird. The no.of records I am getting is different(more than no.of records from source).
Have a join stage and filtering data with a constraint(column from right link of join) in transformer.
Source(CFF) -> join -> Xfrm
Both the input links to join stage are hash partitioned and are sorted on keys.
Could any one suggest me why is this behaving weird on multi nodes?
Have a join stage and filtering data with a constraint(column from right link of join) in transformer.
Source(CFF) -> join -> Xfrm
Both the input links to join stage are hash partitioned and are sorted on keys.
Could any one suggest me why is this behaving weird on multi nodes?