Page 1 of 1

User inserted sort...

Posted: Wed Apr 09, 2008 4:31 am
by prematram
I used a sort stage with hash partionning and after sorting the data , i used remove duplicates stage with same partition.

I am encountering the error as
"User inserted sort "LabResultMem_srt" does not fulfill the sort requirements of the downstream operator "LabResultMem_rdup"

Note:

I have not got the warning when i tried to run this job under 4 partioned nodes, But when i tried running the job with 8 node partitions , i am getting this error.


I searched under this topic,but i have not got answers in relation with no of partitions.is there any problem with no of partitions...?

No of records processed - 300 million records.

Re: User inserted sort...

Posted: Wed Apr 09, 2008 5:26 am
by swapnilverma
prematram wrote:I used a sort stage with hash partionning and after sorting the data , i used remove duplicates stage with same partition.

I am encountering the error as
"User inserted sort "LabResultMem_srt" does not fulfill the sort requirements of the downstream operator "LabResultMem_rdup"

Note:

I have not got the warning when i tried to run this job under 4 partioned nodes, But when i tried running the job with 8 node partitions , i am getting this error.


I searched under this topic,but i have not got answers in relation with no of partitions.is there any problem with no of partitions...?

No of records processed - 300 million records.

First of all why you need to use remove duplicate stage .. you can remove duplicates in sort stage it self ( by setting Allow Duplicates to FALSE)

2nd if you have used join and merge stage Set the environment variable APT_NO_SORT_INSERTION to true ...

Hope it will work !!!

Still having one doubt

Posted: Wed Apr 09, 2008 7:11 am
by prematram
Hi,

Thanks for your response.

But same job when i tried running with config referring 4 nodes, it hasa run successfully. But the problem araised only when we tried running with config file pointing to 8 nodes.

The prevailing problem is anyway related to that...??

Re: Still having one doubt

Posted: Wed Apr 09, 2008 9:44 am
by swapnilverma
prematram wrote:Hi,

Thanks for your response.

But same job when i tried running with config referring 4 nodes, it hasa run successfully. But the problem araised only when we tried running with config file pointing to 8 nodes.

The prevailing problem is anyway related to that...??

Well.. the above warning is coming for which stage...
also provide your design in detail?

Posted: Wed Apr 09, 2008 7:30 pm
by jhmckeever
I know it's obvious but you ARE sorting and de-duping on the same keys, aren't you?

Posted: Wed Apr 09, 2008 9:10 pm
by ray.wurlod
What partitioning is specified on the input (a) of the Sort stage and (b) of the Remove Duplicates stage? Are there any stages between these two?

What are your specified Sort keys? What are your specified Remove Duplicate keys?