Page 1 of 1

Routine to View Data flow between two stages

Posted: Thu Apr 14, 2011 8:58 am
by jweir
Hello all,

Is there a transformer routine out there where I can view the data flow between two stages?

Posted: Thu Apr 14, 2011 9:35 am
by jwiles
Define more clearly what you mean by viewing the data flow.

The Peek stage can be used for displaying data in the job log or to a file.

You can write logic within a transformer that can output formatted data to a link which is connected to a Peek stage.

Regards,

Posted: Thu Apr 14, 2011 12:05 pm
by jweir
Sorry for the general statement.

Here is what I would like to do: Have the row counts of the out put of every stage inside a parallel job. However, I can't alter the parallel job itself to acheive this. I would like to add a routine or something at the sequence level.

Any thoughts/ideas?

Posted: Thu Apr 14, 2011 1:10 pm
by jwiles
You can get that information through BASIC routines inside the sequencer. Also, Kim Duke's ETLStats package can gather that as well.

Regards,

Routine to View Data flow between two stages

Posted: Thu Apr 14, 2011 1:43 pm
by ajay.vaidyanathan
Hi,
In a basic routine you can achieve this logic. You can get the number of records processed per Link in your job in the routine. Call this routine in your job/sequence and then pass the same to your parallel job.

Regards,
Ajay

Posted: Thu Apr 14, 2011 4:42 pm
by ray.wurlod
Given that you already are in a Transformer stage, why not just use an additional output link? This could even feature a constraint driven by a job parameter that indicates whether or not you want to capture data and, if so, how many rows.