how to find out what a partition a dataset uses

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
ruf888
Participant
Posts: 20
Joined: Wed May 13, 2009 1:14 am
Location: Germany

how to find out what a partition a dataset uses

Post by ruf888 »

Hi, all

I'd like to know if there is a way to find out which partition a datasets is using ?
BI-RMA
Premium Member
Premium Member
Posts: 463
Joined: Sun Nov 01, 2009 3:55 pm
Location: Hamburg

Post by BI-RMA »

Persistent in Job log:
Include Variable APT_RECORD_COUNT and set to true. This will report how many rows have traveled which links per node.

In Director job monitor:
In Director when You open a Job Monitor You should see the number of nodes the job used during the last run. Right click on a stage that shows something like "<Stage name> x2" and activate "Show Instances". After that You can see the link statistics per node.
"It is not the lucky ones are grateful.
There are the grateful those are happy." Francis Bacon
ruf888
Participant
Posts: 20
Joined: Wed May 13, 2009 1:14 am
Location: Germany

Post by ruf888 »

Hi, Roland,

I think you misunderstood me. I'd like to know if the datasets is partitioned by which key, not how many nodes it uses.

For example, if I partitioned the datasets by type/model, where can I identify that this ds is partitioned by type/model except in the job itself.
If the job always uses the same partition, then must go back several jobs to find out which partition key it has used.
Is there any way to find it out without lookingat the jobs ?
Jboyd
Participant
Posts: 15
Joined: Mon Mar 14, 2011 12:55 pm

Post by Jboyd »

I believe setting the APT_DUMP_SCORE environment variable to true will assist with this.
Post Reply