Job compilation takes about 4 hrs

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

ethelvina
Participant
Posts: 17
Joined: Sun Mar 30, 2008 10:05 pm

Job compilation takes about 4 hrs

Post by ethelvina »

Job compilation takes about 4 hrs, is there any way to reduce the compilation time?
There are 1145 columns in the input file and all the columns are used for transformation.
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Historically the transform stages have been a cause for delayed compile times, but nothing on the order of 4 hours. Some process must be busy during this extended period, so please check the following:

1. How long do other jobs take to compile?
2. While compiling, check your PC's I/O and CPU usage
3. While compiling, check the DataStage Server's CPU usage.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

It always takes four hours or has on occassion taken that long?
-craig

"You can never have too many knives" -- Logan Nine Fingers
datisaq
Participant
Posts: 154
Joined: Wed May 14, 2008 4:34 am

Post by datisaq »

Hi Arndw,
Can you please tell how to check the CPU usage and I/O usage at compile time?


Regards
datisaq
IBM Certified - Information Server 8.1
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

datisaq - I use the windows taskmgr to see the client PC usage and have sometimes seen the local CPU and particularly the I/O go up significantly during compiles, this is because parts of the compilation are actually done on the PC and not on the server and much data can be passed between the two, hence looking at I/O.
CPU usage on a UNIX server is determined by seeing if a compile process (identified via "ps -ef") is using a lot of CPU for a long period of time. I also use vmstat (most UNIX implementations have their own proprietary tool for detailed measurement which I'll use if I can, but vmstat and iostat are common across all UNIX flavors and give a good overview).
If any compile uses a lot of resources for 4 hours then something is seriously wrong; if the compile takes 4 hours and consumes little or no resources, then something is also seriously wrong. Either way, one needs to collect some statistics first.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Any chance of answering my question? Wondering if this is a single-use-license-queueing issue. :?
-craig

"You can never have too many knives" -- Logan Nine Fingers
agpt
Participant
Posts: 151
Joined: Sun May 16, 2010 12:53 am

Post by agpt »

ArndW wrote:datisaq - I use the windows taskmgr to see the client PC usage and have sometimes seen the local CPU and particularly the I/O go up significantly during compiles, this is because parts of the compilat ...
What does "datisaq" means here :?
anbu
Premium Member
Premium Member
Posts: 596
Joined: Sat Feb 18, 2006 2:25 am
Location: india

Post by anbu »

datisaq is the username.

ArndW was replying to datisaq's post above
You are the creator of your destiny - Swami Vivekananda
agpt
Participant
Posts: 151
Joined: Sun May 16, 2010 12:53 am

Post by agpt »

anbu wrote:datisaq is the username.

ArndW was replying to datisaq's post above
Oh Sorry... I didn't notice that.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

I would still like to have an answer to the question about whether it always takes four hours or whether this was a one-off.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Me, too. :?
-craig

"You can never have too many knives" -- Logan Nine Fingers
ethelvina
Participant
Posts: 17
Joined: Sun Mar 30, 2008 10:05 pm

Post by ethelvina »

Only this job takes a long time to compile, also this job always takes about 4 hrs to compile.Even import of this job takes about 30 mins.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

OK... can you describe the job in more detail? For example, how many transformers does it include? I'm guessing... more than one.
-craig

"You can never have too many knives" -- Logan Nine Fingers
ethelvina
Participant
Posts: 17
Joined: Sun Mar 30, 2008 10:05 pm

Post by ethelvina »

There are 3 transformers and 3 lookup stages.Input dataset has about 1145 columns and there are 11 output datasets, almost all of the input columns are required.
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Unless your server is an old 386SX processor with the turbo switch turned off, or it is extremely overloaded with other process then 3 transforms and 3 lookups shouldn't take that long!

As in my earlier response to datiseq, please measure your system loads to see if you can identify the bottleneck.
Post Reply