Need to develop parallel job

mydsworld · Post by **mydsworld** » Thu Feb 08, 2007 11:54 pm

When we can run a server job in both a single processor and multi-processor systems,why should we go for parallel job (just because it gives more number of stages to work with).

ray.wurlod · Post by **ray.wurlod** » Fri Feb 09, 2007 12:00 am

No reason.

Parallel jobs do give you the future flexibility to spread your processing over multiple CPUs in multiple machines (e.g. in a cluster or grid configuration) and to be able to change the number of partitions without needing to recompile or provide different parameter values, which you would need to do if using server jobs.

And to take advantages of many of the new features in IBM Information Server, you must be running on the parallel architecture.

But if you're happy with what you're doing, and it's performing adequately, then it's perfectly OK to stay there. Server jobs will be supported by IBM for a very long time yet.

kumar_s · Post by **kumar_s** » Fri Feb 09, 2007 12:44 am

Ray, Ain't the Orchestrate engine is more faster than the DsEngine(Server)? Lets just talk about single CPU for same transformation logic.

ray.wurlod · Post by **ray.wurlod** » Fri Feb 09, 2007 2:19 am

For small jobs server jobs are faster (finish more quickly). The startup cost of parallel jobs (even just starting the conductor process, composing the score, starting the section leader processes, distributing the score and starting the player processes - oh, and license checking) is an overhead that server jobs don't have.

I have no quantified results about where the break-even point would be, since this would be hardware-specific in any case. However, as a rule of thumb with a local database I'd opt for a server job for anything up to 1000 rows.

vmcburney · Post by **vmcburney** » Sun Feb 11, 2007 10:54 pm

I've got two blog posts showing how a parallel job can be a lot faster than a server job. Just posted DataStage Tip: Extracting database data 250% faster that reports on a Developerworks article and DataStage server v enterprise: some performance stats. You can still get great performance out of server jobs with techniques such as Unix sorting, CRC32, hash files and multiple instance jobs.