CPU resource utilization

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
senthilmp
Participant
Posts: 85
Joined: Mon Sep 22, 2008 6:11 am

CPU resource utilization

Post by senthilmp »

Hi,
Am facing some issue with performance of the job. The CPU resource on the server on which the target table database(Oracle) resides is not been well utilized by the Datastage job. i.e the Server on which the Database resided is alwasy 60% idle but still the Datastage job is not using the idle resources.

Is their any particular setting in datastage to request more CPU resources?

My job design is extract data from a table , some transformation is applied and update/insert is done to target table.

Can you pls help me out on getting more CPU resources?
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

No, there's no magic setting for that. You may, however, be able to make the job "multi-instance" and run multiple copies, each handling a portion of the data simultaneously.
-craig

"You can never have too many knives" -- Logan Nine Fingers
senthilmp
Participant
Posts: 85
Joined: Mon Sep 22, 2008 6:11 am

Post by senthilmp »

Ok Craig, Thanks
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

If the %CPU use on the DataStage system during job run is not above 80% then your job is not CPU bound, but most likely I/O bound.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Right, it's all about the job design and the resources you are leveraging - disk, database, etc.
-craig

"You can never have too many knives" -- Logan Nine Fingers
muruganr117
Participant
Posts: 40
Joined: Sun Jan 21, 2007 1:52 pm
Location: Chennai
Contact:

CPU% Utilization

Post by muruganr117 »

ArndW wrote:If the %CPU use on the DataStage system during job run is not above 80% then your job is not CPU bound, but most likely I/O bound. ...
Hello,

That was a wonderful piece of information i found after long search.
"Thanks ArndW!!!"
I would like dig further to know the execution process of Server jobs.

Scenario -
> I am working on DS7.1r2 Server , Unix
> Production server has 16physical CPU's,its Sun Solaris M8000 series
> I have a job running on 8 instances , there is no much transformation involved,
I/P HASH file(100fields) looks up from 39 hash files based on KEY and write to O/P seq file(200)length 5000byte per record
> Each instance handles nearly 500,000records

--Item to understand
Even if there are 16CPU's and CPU IDLE time is 60% ~ 70%
each instance is processing with less than 20% CPU as seen in director and process around 400~500 records per second.

I would like to know any method of increasing utilization of resources to improve performance. Any other suggestions with regard to design change of this job is also welcome.

Thank you very much for your time!!!

Really appreciate Chullet, ArndW efforts... :-)

regards
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

(I'm depressed, I wrote a long post and it has disappeared, and now I'm out of steam to re-write it...)

Don't touch your job or system until you know where your bottleneck is. You will need to measure and analyse what your system is doing before making any changes; otherwise you might cause everything to slow down instead of speeding up.

With the CPU load described it is clear that your system is not constrained by processing power at present. Memory is probably not an issue, since paging/swapping conditions usually will increase your %CPU significantly.

That leaves Disk I/O or, in the case of a SAN, network I/O. I don't know which flavour of UNIX you have to recommend a measuring tool, but the standard UNIX command iostat will give you a rough idea of I/O rates and distribution.
muruganr117
Participant
Posts: 40
Joined: Sun Jan 21, 2007 1:52 pm
Location: Chennai
Contact:

CPU% utilization

Post by muruganr117 »

Thank you very much ArndW

We are using Sun solaris 5.10(is this the UNIX flavor ?) , normally i do verify prstat, vmstat which i think more of CPU related, i shall understand & verify iostat and get back :)

Thank you very much once again!!!
ArndW wrote:(I'm depressed, I wrote a long post and it has disappeared, and now I'm out of steam to re-write it...)

Don't touch your job or system until you know where your bottleneck is. You will need to measure and analyse what your system is doing before making any changes; otherwise you might cause everything to slow down instead of speeding up.

With the CPU load described it is clear that your system is not constrained by processing power at present. Memory is probably not an issue, since paging/swapping conditions usually will increase your %CPU significantly.

That leaves Disk I/O or, in the case of a SAN, network I/O. I don't know which flavour of UNIX you have to recommend a measuring tool, but the standard UNIX command iostat will give you a rough idea of I/O rates and distribution.
Post Reply