Hi !
My data content characters that DS cn resolve and these chars replaced by "?" and I see it . But really it doesnt "?" in tranformer I tries to find it with INDEX Basic function and returned 0 .
Does anybody have clue how to remove this chars before it inserted to db ?
NLS and ? Problem
Moderators: chulett, rschirm, roy
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Different sets of characters are encoded according to different standards. For example, Chinese characters may be encoded according to standards called BIG5 or GB2312. There are many different standards for encoding Japanese characters. DataStage provides maps for most of the commonly-encountered standards; the DataStage NLS manual shows how you can build your own maps if your data are encoded according to a standard that is not already supported. What is vital is that you know how your data are encoded.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
"?" is NOT the invalid character. "?" is a special Unicode character that DataStages uses when it can not convert the invalid character into Unicode (its own UTF-8 encoding of Unicode) based on your assertion of how the external character set is encoded.
You must determine precisely under what standard the external character set is encoded, and place the map for that character set's encoding between it and DataStage.
You may also need to specify a different, though related, character map between DataStage server and client, so that UNIX-to-Windows character set mapping might occur. These secondary maps have names ending in "-CS".
You must determine precisely under what standard the external character set is encoded, and place the map for that character set's encoding between it and DataStage.
You may also need to specify a different, though related, character map between DataStage server and client, so that UNIX-to-Windows character set mapping might occur. These secondary maps have names ending in "-CS".
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.