Page 1 of 1

Removal of Junk Characters using parallel transformer

Posted: Tue Jul 22, 2008 12:39 am
by Kbhujad
We have created a Datastage parallel job to remove junk characters from a sequential file and load data into Netezza table.

We are able to remove the Junk Characters using Basic transformer using Convert and Oconv function.

But we think that the Basic transformer might create a perfomance issue.

So can anyone suggest an alternative solution to remove junk character using parallel transformer or anything else.

Posted: Tue Jul 22, 2008 12:49 am
by keshav0307

Posted: Tue Jul 22, 2008 1:03 am
by ray.wurlod
Make very sure that they're junk! Don't assume anything.

Posted: Tue Jul 22, 2008 2:31 am
by Kbhujad
We have confirmed that these characters are control characters.

Posted: Tue Jul 22, 2008 8:32 am
by miwinter
When you say 'control characters' - what do you mean exactly? (newline characters? tabs?) Examples please.

Posted: Tue Jul 22, 2008 11:44 pm
by Kbhujad
Control characters like ^A,^Z...etc should get removed.

The control characters having Ascii values like "\000\001\002\003\004\005\006\007\010\011\013\014\015\016\017\020\021\022\023\024\025\026\027\030\031\032\033\034\035\036\037\177\377"
should get reomoved from the input field.