Remove bad rows before loading

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
sam334
Premium Member
Premium Member
Posts: 145
Joined: Mon Aug 26, 2013 7:42 pm

Remove bad rows before loading

Post by sam334 »

All need a help in removing bad rows while loading.

We have a job which reads a pipe delimited file and load in DB2 database. Issue is that, sometimes in one or two records before pipe we are getting # sign. Datastage assuming it is a delimiter and aborting the job.

Can we do something to reject the # included rows whenever it comes, by which the job will not abort.

This is a source system issue but unfortunately can not do anything with it.

Thanks..
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Can you post the 'abort' you are getting? Don't understand why it would think a # is a delimiter so wondering if the actual abort text might help clear that up.
Last edited by chulett on Thu Mar 05, 2015 7:18 pm, edited 1 time in total.
-craig

"You can never have too many knives" -- Logan Nine Fingers
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Assuming you're using a Sequential File stage you could use a Filter command to drop or edit the # signs out of the data.

For example to bypass those records, use the grep command, while to alter them use the sed or awk command.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply