Page 1 of 1

any strategy whether to go for parallel job or server job

Posted: Wed Jul 12, 2006 5:51 pm
by umamahes
is there any parameters to decide to go for parallel job or server job

Posted: Wed Jul 12, 2006 6:00 pm
by kumar_s
Welcome Aboard :D

Availabe resource, (money) and the amount of data to be process with in the batch window should be the main criteria you should lookup on.

Posted: Wed Jul 12, 2006 7:58 pm
by sri1dhar
My personal experience is Parallel jobs may perform better but they take longer to develop. There are several bugs & the issues just keep coming. Server Edition is much stable. Still we decided to stick with Parallel jobs.

Posted: Wed Jul 12, 2006 8:20 pm
by ray.wurlod
For single row jobs (such as select max(col) from table) a server job is probably faster, because its startup time is smaller.

Using parallelism techniques in server jobs can handle surprisingly large volumes of data. However, the fact that parallel jobs can scale automatically is a big plus.i

I'm still guided by gut feel rather than by quantified metrics. Even a large volume of data with a lot of date/time manipulation may give me reason to pause and at least consider using a server job. On the other hand, that's what I grew up with, so it's probably a biased view.

Budget would, of course, be a consideration also.

Posted: Thu Jul 13, 2006 6:27 pm
by vmcburney
Parallel jobs are faster on large volumes of data via all the parallel partitioning capabilities, they are also faster with sorting and aggregation functions even when compared in a non-parallel mode. The C++ stages of parallel jobs seem to be more efficient. However parallel jobs have a slower startup time (something they are trying to fix in Hawk) and they are fussier about metadata and when you first start using them it takes time getting used to all the warnings.

Server edition is cheaper and somewhat easier to use initially. However parallel is easier in some areas: very large data volumes, change data capture stage, more join/lookup/merge functionality and flexibility, easier to build custom stages.

Since server jobs and parallel jobs can run from the same job sequences it is quite easy to move from server to parallel by converting just those jobs that handle the highest volumes and leaving the bulk as server jobs.

You can read my blogs on the subject:
Process in parallel or take up folk dancing
DataStage server v enterprise: some performance stats
Hawk overview, screenshots and questionnaires!