handling invalid data in dataset

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
Poovalingam
Participant
Posts: 111
Joined: Mon Nov 30, 2009 7:21 am
Location: Bangalore

handling invalid data in dataset

Post by Poovalingam »

Hi gurus,
I need a suggestion or solution in a design for which Source is a data set and target is Oracle.
The meta data of dataset and the Oracle table is same.
When the dataset is being created with any invalid data to the corresponding data type, some ****** values are assigned to them.
Now, before reaching oracle, how I do can stop propagation of those invalid records into the Oracle?

thanks in advance,
Poova.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

How the heck did you get invalid data into a Data Set in the first place?!!

These should be rejected in the job that populates the Data Set.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
gssr
Participant
Posts: 243
Joined: Fri Jan 09, 2009 12:51 am
Location: India

Re: handling invalid data in dataset

Post by gssr »

Poovalingam wrote:Hi gurus,
I need a suggestion or solution in a design for which Source is a data set and target is Oracle.
The meta data of dataset and the Oracle table is same.
When the dataset is being created with any invalid data to the corresponding data type, some ****** values are assigned to them.
Now, before reaching oracle, how I do can stop propagation of those invalid records into the Oracle?

thanks in advance,
Poova.
You can filter the records with invalid data (*****) using filter stage!
RAJ
sudhanmaad
Participant
Posts: 3
Joined: Tue Jan 12, 2010 10:52 pm
Location: chennai

Post by sudhanmaad »

May be the Conversion that you have done is wrong , or format is wrong(like you have convertd into a Timestamp and in metadata definition you have given Date) . these are the scenarios where Data appears '**********' in Dataset. if this is not the case then Data is wrong.please filter properly.
Poovalingam
Participant
Posts: 111
Joined: Mon Nov 30, 2009 7:21 am
Location: Bangalore

Post by Poovalingam »

Thanks all.

To Prepare dataset, the source is a fixed width file and we captured all data as char data type. Then we used transformer for doing some transformation and to do required type conversion. But some data reaching as *** into the dataset. :?

Do we need to write filter condition for every column whichever i'm feeling that may contain bad value?

I want to know as well,
1. Doesn't the transformer reject the record when doing type conversion and transformation for invalid data?
2. When invalid data (***) enters into the data set, Will DS throw warning?

Regards,
Poova.
priyadarshikunal
Premium Member
Premium Member
Posts: 1735
Joined: Thu Mar 01, 2007 5:44 am
Location: Troy, MI

Post by priyadarshikunal »

seems like the data is being padded with '*' or during null handling *** is assigned. check the data flow.
Priyadarshi Kunal

Genius may have its limitations, but stupidity is not thus handicapped. :wink:
ETLJOB
Participant
Posts: 87
Joined: Thu May 01, 2008 1:15 pm
Location: INDIA

Post by ETLJOB »

Probably you are missing out some valid data because of improper type conversion, null handling etc and you named it as "invalid" data. Throughly check your entire flow again...
ETLJOB
Participant
Posts: 87
Joined: Thu May 01, 2008 1:15 pm
Location: INDIA

Post by ETLJOB »

Also, let us know what your log says?
sudhanmaad
Participant
Posts: 3
Joined: Tue Jan 12, 2010 10:52 pm
Location: chennai

Post by sudhanmaad »

Just try mapping all the columns to dataset without any transformation done , and check whether now the data is still with '********'
nagarjuna
Premium Member
Premium Member
Posts: 533
Joined: Fri Jun 27, 2008 9:11 pm
Location: Chicago

Post by nagarjuna »

I think something is wrong in null handling of timestamp fields .Give the null field value . Is there any reason why you are reading with correct datatype in seq file .It will automatically reject invalid data .
Nag
Post Reply