Generate report on data movement from source to target

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
mchivuku
Participant
Posts: 21
Joined: Sat Feb 04, 2006 7:38 am

Generate report on data movement from source to target

Post by mchivuku »

Hi,

My requirement is to generate a report that gives the following details for each job that runs on the server.
Number of rows read from source - a
Number of rows inserted into target - b
Number of rows rejected - c
So in any case a = b+c

This kind of reconciliation is a part of the process in our project.
If this report is available in graphical format, then that would be appreciated.

I have tried the options in dsjobs command,DSJobReport in after job sub-routine and the metadata workbench.

But the end result is expected to be something like the DS Job Monitor which shows the data flow through the various stages in the job.

Please advice me on how to proceed.

Thanks,
Murali
kduke
Charter Member
Charter Member
Posts: 5227
Joined: Thu May 29, 2003 9:47 am
Location: Dallas, TX
Contact:

Post by kduke »

Unless you have a reject link then you will never capture these numbers. There are lots of links which drop rows like remove dups. So not all jobs will add up.

EtlStats will save you a lot time building your own solution.
Mamu Kim
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

You may be able to compute "c" from "a-b" but as noted it's not always true rejects. You'd need to specifically capture those and even then things may not foot.
-craig

"You can never have too many knives" -- Logan Nine Fingers
kduke
Charter Member
Charter Member
Posts: 5227
Joined: Thu May 29, 2003 9:47 am
Location: Dallas, TX
Contact:

Post by kduke »

Some lookups can create dups if more than one row comes back from lookup.
Mamu Kim
nagarjuna
Premium Member
Premium Member
Posts: 533
Joined: Fri Jun 27, 2008 9:11 pm
Location: Chicago

Post by nagarjuna »

we are doing similar sort of thing in our project ( Auto balancing control ABAC ) ..we are capturing the reject count , input count , o/p count .If o/p is not equal to input + reject then it means some records are dropped while passing through some of the stages . Again this will depends on what level you want to reconcile .
Nag
mchivuku
Participant
Posts: 21
Joined: Sat Feb 04, 2006 7:38 am

Post by mchivuku »

Thanks for all your replies!!!
I definitely agree only if rejects are designed accordingly, we will be able to reconcile source,target and rejects.
Our jobs are designed and tested fine in such a way that source is always an addition of target count and reject count.
Now the issue is only with the generation of an appropriate report.
For eg : In the job monitor,I am able to see the data count processed by each stage in the job.Hence we have renamed the appropriate stages to give more meaning for our reconciliation.

Please advice me on how a graphical report can be generated which gives me the data flow through each stage in the jobs.

Thanks,
Murali
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Run in Director, press the Print Screen when it's done and paste into MS Paint or similar. I don't know any other graphical report.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply