Routine to View Data flow between two stages

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
jweir
Participant
Posts: 134
Joined: Wed Aug 04, 2010 12:31 pm

Routine to View Data flow between two stages

Post by jweir »

Hello all,

Is there a transformer routine out there where I can view the data flow between two stages?
Jweir

--- If strength were all, tiger would not fear scorpion.
jwiles
Premium Member
Premium Member
Posts: 1274
Joined: Sun Nov 14, 2004 8:50 pm
Contact:

Post by jwiles »

Define more clearly what you mean by viewing the data flow.

The Peek stage can be used for displaying data in the job log or to a file.

You can write logic within a transformer that can output formatted data to a link which is connected to a Peek stage.

Regards,
- james wiles


All generalizations are false, including this one - Mark Twain.
jweir
Participant
Posts: 134
Joined: Wed Aug 04, 2010 12:31 pm

Post by jweir »

Sorry for the general statement.

Here is what I would like to do: Have the row counts of the out put of every stage inside a parallel job. However, I can't alter the parallel job itself to acheive this. I would like to add a routine or something at the sequence level.

Any thoughts/ideas?
Jweir

--- If strength were all, tiger would not fear scorpion.
jwiles
Premium Member
Premium Member
Posts: 1274
Joined: Sun Nov 14, 2004 8:50 pm
Contact:

Post by jwiles »

You can get that information through BASIC routines inside the sequencer. Also, Kim Duke's ETLStats package can gather that as well.

Regards,
- james wiles


All generalizations are false, including this one - Mark Twain.
ajay.vaidyanathan
Participant
Posts: 53
Joined: Fri Apr 18, 2008 8:13 am
Location: United States

Routine to View Data flow between two stages

Post by ajay.vaidyanathan »

Hi,
In a basic routine you can achieve this logic. You can get the number of records processed per Link in your job in the routine. Call this routine in your job/sequence and then pass the same to your parallel job.

Regards,
Ajay
Regards
Ajay
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Given that you already are in a Transformer stage, why not just use an additional output link? This could even feature a constraint driven by a job parameter that indicates whether or not you want to capture data and, if so, how many rows.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply