National characters converted data from Baltic/ Poland

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

devesh_ssingh
Participant
Posts: 148
Joined: Thu Apr 10, 2008 12:47 am

Post by devesh_ssingh »

Hi,

any input are much appreciated!!!

Br,
Devesh
Atangell
Participant
Posts: 1
Joined: Sun Apr 17, 2011 11:51 am
Location: India

Post by Atangell »

Hi Devesh,

Are you reading from a file or a database?
It is important to know the nls-settings from the source team, used to produce the source data so that it can be read correctly.

If your are reading from a database and using an odbc stage - the dsn in odbc.ini needs to be modified to have the correct codepage - for eg/- if source data is written in ms1252 - add the codepage - 2252 to read it.
Cheers
Ashwin!
devesh_ssingh
Participant
Posts: 148
Joined: Thu Apr 10, 2008 12:47 am

Post by devesh_ssingh »

Hi,

File is on mainframe server. I am ftping it to UNIX server by simple get command.
The job loads a simple Teradata stage table.
The NLS setting on source system level is set to NLS IBM1112 and on ETL ( Datastage and TD) server is ISO-8859-1 i.e UTF 8.
When i tried to change NLS ( from ISO-8859-1 to IBM1112 ) on job level it aborts by saying import has failed.

This is issue is still open. any more input is highly appreciated.

Thanks and regards,
Devesh singh
priyadarshikunal
Premium Member
Premium Member
Posts: 1735
Joined: Thu Mar 01, 2007 5:44 am
Location: Troy, MI

Post by priyadarshikunal »

which codepage you tried for this? are you getting the same value when you write it to a file or just in database?
Priyadarshi Kunal

Genius may have its limitations, but stupidity is not thus handicapped. :wink:
devesh_ssingh
Participant
Posts: 148
Joined: Thu Apr 10, 2008 12:47 am

Post by devesh_ssingh »

I have tried with IBM1112 Estonian char set.
is this somthing wrong with DS NLS setting. if we changed code page it should read it properly.
asorrell
Posts: 1707
Joined: Fri Apr 04, 2003 2:00 pm
Location: Colleyville, Texas

Post by asorrell »

When you get it via FTP, is the transfer mode set to binary or text mode? It should be binary mode to prevent any changes.
Andy Sorrell
Certified DataStage Consultant
IBM Analytics Champion 2009 - 2020
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

It's been three months now... have you not engaged your official support provider to help with this?
-craig

"You can never have too many knives" -- Logan Nine Fingers
devesh_ssingh
Participant
Posts: 148
Joined: Thu Apr 10, 2008 12:47 am

Post by devesh_ssingh »

yes and solotion reached till unix, DS loading is still issue.

quick input is much appreciated!!!
:shock: :?
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

You've confirmed that the data is stored in UNIX in 8859-4. So you need to tell DataStage to use this codepage when reading the source file. After that, any character set conversions you do will work, since the data has been correctly read in.
devesh_ssingh
Participant
Posts: 148
Joined: Thu Apr 10, 2008 12:47 am

Post by devesh_ssingh »

Hi Arndw,

i applied the NLS setting with same code page but same is not appearing while loading in table.

Is this something DS is not set to support that code page.
i have checked all the installed code page by iconv command and can see same 8859-4 in the list.
not sure how to proceed further.

Any more input?

Br,
Devesh
devesh_ssingh
Participant
Posts: 148
Joined: Thu Apr 10, 2008 12:47 am

Post by devesh_ssingh »

one more observation.

TD connector is being used to load the data into target table.

the settting for character setting is done as below.

Client-character set -- UTF8

server character set --Unicode.

do we have option to include NLS setting to 8859-4 in connector property?

is it converting character while loading the file?
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

There are several possibilities where these characters are being interpreted incorrectly.

1. FTP from Mainframe to server.
2. sequential read into DataStage
3. Output from DataStage to Teradata

Doing a "view data", especially from within DataStage, is not going to help, as it introduces yet another source of errors.

You need to identify the row and column with the error, then make a copy of your job which uses the SEQ() function to give you the numeric value of the single character in question. First from UNX on the source file after FTP. If that one character is correct there, then run the DataStage job and do a peek of that value, if it is correct in the DataStage job then the error source is in the DataStage-Teradata interface.
devesh_ssingh
Participant
Posts: 148
Joined: Thu Apr 10, 2008 12:47 am

Post by devesh_ssingh »

content is hidden for paid user.

looks like you point to setting on TD.
devesh_ssingh
Participant
Posts: 148
Joined: Thu Apr 10, 2008 12:47 am

Post by devesh_ssingh »

I have removed the TD connector with seq file stage and run the job and can see the desried output. so it issue with TD connector setting.

next i have tried to set NLS setting ISO_8859_4:1988 in TD but non of character are getting inserted in Table. :?
devesh_ssingh
Participant
Posts: 148
Joined: Thu Apr 10, 2008 12:47 am

Post by devesh_ssingh »

I have checked the code support page on terdata client and found below are installed

Teradata character set Language IBM DB2 code set
Latin1_0A" English (Latin 1) ISO8859-1
Latin1252_0A" English (Win Latin) ISO8859-1/15
LATIN9_0A not sure
LATIN1252_3A0 not sure
LATIN1252_0A

there is no ISO8859-4 support on TD client ? can this be issue?
i am just guessing..
Post Reply