Japanese character conversion

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
DatastageCoEUser
Premium Member
Premium Member
Posts: 36
Joined: Mon Mar 03, 2014 3:09 am

Japanese character conversion

Post by DatastageCoEUser »

We are facing Japanese Character issue in DataStage during data load. We are getting EBCDIC file as a source containing Japanese (Hirakana) data and while we are loading the data into a flat file we are getting Japanese characters but in some other format (Katakana). We have tried all possible code pages (NLS code map) that are available in DataStage that support Japanese Character conversion. But we are not able to resolve this conversion problem.

Can someone help ?
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

The Hiragana and Katakana code points are distinct, and there are even differences in how many characters each has. Thus it is unlikely that a hiragana "Ha" is transformed to a katakana "Ha".

You need to get information from your EBCDIC Japanese source as to exactly which character set is being used. There are numerous possible source encodings and your problem is that you are just getting the wrong one, which can result in incorrect character conversions.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

We are aware of at least fourteen different ways of encoding Japanese characters. As Arndo-san noted, you must ascertain precisely how these data are encoded.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply