Parallel Vs Server Performance Test - Unexpected Results
Moderators: chulett, rschirm, roy
In link sort in the target stage could be one reason. And when both the source and target are stages that run only in sequential mode, the idea of parallelism is lost. And more over the startup times are always more for parallel jobs and increase with the number of stages and number of node as apparent from your experience. But your run times show an abnormal (to me) increase in times. Probably look at the job log to see the start up times and run times and compare them. Try with a DataSet as source and target in the parallel job.
In link sort in the target stage could be one reason. And when both the source and target are stages that run only in sequential mode, the idea of parallelism is lost. And more over the startup times are always more for parallel jobs and increase with the number of stages and number of node as apparent from your experience. But your run times show an abnormal (to me) increase in times. Probably look at the job log to see the start up times and run times and compare them. Try with a DataSet as source and target in the parallel job.
Aggt stage is working in Parallel execution mode, Partition type is 'SAME'.
I tried with removing 'Perform sort' option in target sequential file stage and Used sorted merge collection method but not getting sorted o/p.
Server load is zero during tests.
I tried with removing 'Perform sort' option in target sequential file stage and Used sorted merge collection method but not getting sorted o/p.
Server load is zero during tests.
I never let school to interfere in my education
-
- Participant
- Posts: 3593
- Joined: Thu Jan 23, 2003 5:25 pm
- Location: Australia, Melbourne
- Contact:
What you are seeing is the overheads of partitioning and re-partitioning your data, which is why four nodes takes longer than two. I am surprised a single node is slower than a server job - my own benchmark shows parallel jobs sort and aggregate many times faster than server jobs. During the parallel sort it will write some data out to temporary files, I think it's in the temp directory, and it looks like you have inefficient file i/o. If you are on version 8 I would switch on some of the job monitoring features to see what is slowing your jobs down.
Certus Solutions
Blog: Tooling Around in the InfoSphere
Twitter: @vmcburney
LinkedIn:Vincent McBurney LinkedIn
Blog: Tooling Around in the InfoSphere
Twitter: @vmcburney
LinkedIn:Vincent McBurney LinkedIn