Issue with Parallel loads with logical nodes

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
vskr72
Premium Member
Premium Member
Posts: 128
Joined: Wed Apr 28, 2004 9:36 pm

Issue with Parallel loads with logical nodes

Post by vskr72 »

We have a simple job that uses a direct table to table dump with a Copy stage in between. This is what we did:
- We created 2 logical nodes in the config file and used that config file in the current job
- Added the 2 nodes in the node map constraint field
- Rest of the options are defaults.

When I run this job, the number of records that load in the target is twice of what we have in the source table. If I increase it to 4, then the target has 4 X.

I am new to DS. Can any one tell me what do i need to configure in this case?

Satish
mavrick21
Premium Member
Premium Member
Posts: 335
Joined: Sun Apr 23, 2006 11:25 pm

Post by mavrick21 »

"Added the 2 nodes in the node map constraint field " ->Do you have any particular reason for adding this?

Can you remove it from node map constraint field and check?
vskr72
Premium Member
Premium Member
Posts: 128
Joined: Wed Apr 28, 2004 9:36 pm

Post by vskr72 »

Even if I remove the nodes in the Nodemap constraint, it still is duplicating based on the number of nodes.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Then you've got a partitioning issue. Can you describe your job design - stages, partitioning settings per stage, etc?
-craig

"You can never have too many knives" -- Logan Nine Fingers
vskr72
Premium Member
Premium Member
Posts: 128
Joined: Wed Apr 28, 2004 9:36 pm

Post by vskr72 »

This is my job design:

Oracle Ent Stage --> Copy --> Seq.File

Oracle Stage:
- have a SQL override with conditions rownum < 50000
- Buffering is default
- Combinability - Auto
- Preserve partition default

Copy Stage:
- Preserve partition propagate
- Auto Partitioning

Seq File:
Default settings

Thanks
vskr72
Premium Member
Premium Member
Posts: 128
Joined: Wed Apr 28, 2004 9:36 pm

Post by vskr72 »

Also have this in the job properties for $APT_ORACLE_LOAD_OPTIONS:

OPTIONS(DIRECT=FALSE,PARALLEL=FALSE)
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Why have 'load options' when there is no loading going on? :?

If your Oracle stage's Execution Mode is parallel, try setting it to sequential.
-craig

"You can never have too many knives" -- Logan Nine Fingers
udayk_2007
Participant
Posts: 72
Joined: Wed Dec 12, 2007 2:29 am

Post by udayk_2007 »

Remove the copy stage if it is not required and try running the job. Check if you get the same result
Sainath.Srinivasan
Participant
Posts: 3337
Joined: Mon Jan 17, 2005 4:49 am
Location: United Kingdom

Post by Sainath.Srinivasan »

View the run statistics from the director with the rows per node mode enabled. That will give you an idea of which stage is increasing the volume.
Post Reply