Column Import Issue

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
pravin1581
Premium Member
Premium Member
Posts: 497
Joined: Sun Dec 17, 2006 11:52 pm
Location: Kolkata
Contact:

Column Import Issue

Post by pravin1581 »

Hi All,

We are facing an issue with the column import stage. The input file has one column which contains the 36 rows , where the data is pipe delimited.
Through the column import stage it is getting divided into 2 output cols based on the pipe delimiter. The output file should contain 36 rows but only 9 rows are getting populated in the output file, we are using 2 node config file, changing the partiiton type to Entire in the Column Import Stage, changes the row count to 18 but the records are getting repeated,the same 9 records are getting repeated twice.

Thanks in advance.
Minhajuddin
Participant
Posts: 467
Joined: Tue Mar 20, 2007 6:36 am
Location: Chennai
Contact:

Post by Minhajuddin »

Column import stage doesn't need any keyed partitioning because it doesn't work on any keys. So, you don't even need "Entire Partitioning". And if Entire partitioning is used, then you'll get one record repeated as many times as there are nodes in your config file.

So, now about your problem. You said you want 36 rows in the output, Now, how many rows do you have in the input? Column import stage is not supposed to change the number of rows.
Minhajuddin

<a href="http://feeds.feedburner.com/~r/MyExperi ... ~6/2"><img src="http://feeds.feedburner.com/MyExperienc ... lrow.3.gif" alt="My experiences with this DLROW" border="0"></a>
pravin1581
Premium Member
Premium Member
Posts: 497
Joined: Sun Dec 17, 2006 11:52 pm
Location: Kolkata
Contact:

Post by pravin1581 »

Minhajuddin wrote:Column import stage doesn't need any keyed partitioning because it doesn't work on any keys. So, you don't even need "Entire Partitioning". And if Entire partitioning is used, then you'll get one record repeated as many times as there are nodes in your config file.

So, now about your problem. You said you want 36 rows in the output, Now, how many rows do you have in the input? Column import stage is not supposed to change the number of rows.
Thanks for the reply, there are 36 rows in the input.
pravin1581
Premium Member
Premium Member
Posts: 497
Joined: Sun Dec 17, 2006 11:52 pm
Location: Kolkata
Contact:

Post by pravin1581 »

Any suggestions.
OddJob
Participant
Posts: 163
Joined: Tue Feb 28, 2006 5:00 am
Location: Sheffield, UK

Post by OddJob »

Have you checked Director log to see if any input records are being rejected? Try introducing a reject link on the Column Import stage and forcing rejected output to this link (to a peek stage) by setting 'Reject Mode' to 'Output'.
Minhajuddin
Participant
Posts: 467
Joined: Tue Mar 20, 2007 6:36 am
Location: Chennai
Contact:

Post by Minhajuddin »

Do any of the rows have a NULL value in the column which you are splitting?
Minhajuddin

<a href="http://feeds.feedburner.com/~r/MyExperi ... ~6/2"><img src="http://feeds.feedburner.com/MyExperienc ... lrow.3.gif" alt="My experiences with this DLROW" border="0"></a>
Post Reply