DataStage/QualityStage 8.0 : Support for CJK languages

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
byk
Participant
Posts: 8
Joined: Wed Mar 07, 2007 8:43 am

DataStage/QualityStage 8.0 : Support for CJK languages

Post by byk »

Please help me understand the support for CJK (Chinese-Japanese-Korean) languages.
If you have any ibm website link for the same, please let me know.

Thanks
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

In order to understand how DataStage NLS uses CJK, you need to start off with understanding multibyte implementations at a system level; the rest is just an implementation question. There are some excellent reference works available (hardcover ones) that provide a very solid foundation in the basic concepts and methods.
byk
Participant
Posts: 8
Joined: Wed Mar 07, 2007 8:43 am

Post by byk »

Thanks. What I a more interested is to know it from DataStage perspective. For e.g.

1. Any special settings (apart from NLS) in DataStage
2. Known issues where certain scripts are not supported etc.
3. Any special consideration for stages of QualityStage
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

I can't think of anything offhand that isn't in the manuals and don't have recent QualityStage experience. We certainly had a number of teething issues with Japanese, Chinese and Korean implementations years ago but the installed bases and experience levels in those languages is quite widespread/high now so any inherent problems will have been dealt with.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

There is a DataStage NLS manual. Read that. The CJK characters appear in each of the appropriate Chinese, Japanese and Korean character maps.

Note that server jobs use UV-UTF8 while parallel jobs use ICU (a "true" 16-bit implementation of Unicode).
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply