i want to capture bad records

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
chandra
Participant
Posts: 88
Joined: Sun Apr 02, 2006 6:50 pm
Location: India

i want to capture bad records

Post by chandra »

how can i caputer the bad records from dataset.
suppose i have 10 records .
6th and 7th records are bad records .
how to achive this any clues?
chandra ,
Hyd
kris007
Charter Member
Charter Member
Posts: 1102
Joined: Tue Jan 24, 2006 5:38 pm
Location: Riverside, RI

Post by kris007 »

It depends on how you define your bad records. Usually, you define a reject link to capture all reject data which are defined based upon the constraints as per your business requirements. In your case, from the example you have given,, it depends on how you identify your seventh and eighth records as bad data.
chandra
Participant
Posts: 88
Joined: Sun Apr 02, 2006 6:50 pm
Location: India

Post by chandra »

i gave you example of 6th and 7th .
in this case i dnt want to use any constrains .
i want to capture the bad records from raw data.
chandra ,
Hyd
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

You don't use DataStage.

Constraints are DataStage's way of directing data flows.

To do it purely in source requires a HarryPotter stage. :lol:
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
vmcburney
Participant
Posts: 3593
Joined: Thu Jan 23, 2003 5:25 pm
Location: Australia, Melbourne
Contact:

Post by vmcburney »

Have a look at my blog on the data quality firewall within a DataStage job. This lets you capture bad records as you process the raw data and deliver it to your datasets. If you want to identify bad records within the source databases you will need some type of profiling tool such as ProfileStage.
kumar_s
Charter Member
Charter Member
Posts: 5245
Joined: Thu Jun 16, 2005 11:00 pm

Post by kumar_s »

As mentioned, define what is 'bad record' in you case?
Is it alwasy 6th and 7th in order?
Impossible doesn't mean 'it is not possible' actually means... 'NOBODY HAS DONE IT SO FAR'
rony_daniel
Participant
Posts: 36
Joined: Thu Sep 01, 2005 5:44 am
Location: Canada

Post by rony_daniel »

kumar_s wrote:As mentioned, define what is 'bad record' in you case?
Is it alwasy 6th and 7th in order?
Chandra,

Bad Records is purely a relative term. It depends on your business requirements. The record which one business calls as bad record may not be a bad record for another. Some of the common scenariaos would be like :-
1) If your source is a fixed length file (say 100) and if the 6th and 7th records are having a length of 95 and 105, then you may call it as BAD.
2) If you are loading the data into a database where you have defined some of the columns as Date datatype and your 6th and 7th record has values in these fields as "30/Feb/2006" and "31/Apr/2006", then you may call it as BAD. Similarly for numeric datatypes...
Post Reply