Link Partitioner

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
phillip.small
Participant
Posts: 29
Joined: Mon Aug 16, 2004 8:12 am

Link Partitioner

Post by phillip.small »

I am trying to use a Link Paritioner to see if it would speed up our 2 million row load. However, I am not sure how to set it up. Where do I link? I am getting an error. Is the link partitioner output from the transformer stage or the input file stage?
ketfos
Participant
Posts: 562
Joined: Mon May 03, 2004 8:58 pm
Location: san francisco
Contact:

Post by ketfos »

Hi,
The Partitioner stage partitions data, it is processed in parallel, then the Collector stage collects it together again before writing it to a single target.

A sequential file can be input to Link Partition Stage. which can take up to 64 links and colects data from this link (Link Collector) and route them to single output link.

Please refer to datastage help on this.

What error message are you getting?

Ketfos
tonystark622
Premium Member
Premium Member
Posts: 483
Joined: Thu Jun 12, 2003 4:47 pm
Location: St. Louis, Missouri USA

Post by tonystark622 »

When I've used them in the past, it's been something like this:

Code: Select all

                      ->XFormer--->other       -->
                    /               processing     \
                   /                                 \
--->XFormer-->Link ---->XFormer--->other       ---->Link       --->XFormer
              Partitioner           processing      Collector
                   \                                 /
                     \                             /
                       ->XFormer--->other       --> 
                                     processing
I have some with sort stages, and transformers and aggregators. Typically, you will be doing the same processing on each "leg" of the split processes.

Hope this helps,
Tony
phillip.small
Participant
Posts: 29
Joined: Mon Aug 16, 2004 8:12 am

Post by phillip.small »

ketfos wrote:Hi,
The Partitioner stage partitions data, it is processed in parallel, then the Collector stage collects it together again before writing it to a single target.

A sequential file can be input to Link Partition Stage. which can take up to 64 links and colects data from this link (Link Collector) and route them to single output link.

Please refer to datastage help on this.

What error message are you getting?

Ketfos
The error that I am getting is:

The Link Collector does not support in-process active-to-active inputs or outputs. (Jobname.linkcollectorname).
chucksmith
Premium Member
Premium Member
Posts: 385
Joined: Wed Jun 16, 2004 12:43 pm
Location: Virginia, USA
Contact:

Post by chucksmith »

In Designer, edit your job properties (the yellow icon). On the performance tab, select the Inter process button, the recompile.
vmcburney
Participant
Posts: 3593
Joined: Thu Jan 23, 2003 5:25 pm
Location: Australia, Melbourne
Contact:

Post by vmcburney »

Is this job design any faster or more effective than a non partitioned job? On Unix systems you need to run seperate instances of jobs to get true parallel performance, this job may have several streams but it will still be allocated to just one CPU. The cost of splitting and joining may be greater then the benefits of having three streams.

I'd be interested if you could run a test comparing the performance of the partitioned job to a version of the job that has a single stream.
Post Reply