Page 1 of 1

Eleminating first row in SQF

Posted: Sun Aug 14, 2005 11:22 pm
by ssgrpid
Hi all,

One of the sources in my parallel job is a SQf that has a header. The job uses 4 nodes and before performing any operations on the SQF, I would like to remove the header and pass the remaining data to the stages.

I thought of using a UNIX script but the number of rows in SQF are very very high and thus performs poorly. Another option is to use a transformer. But I want to reduce the use a transformer in ||el jobs.

Can anyone suggest some good options for doing this.

Re: Eleminating first row in SQF

Posted: Sun Aug 14, 2005 11:54 pm
by tusharzade
In Sequential file stage you have a option where you can set the first the line as column header. So DS will send the remaining data to next stage. You do not need to use unix script or transformer.

Regadrs,
Tushar

Posted: Mon Aug 15, 2005 12:45 am
by ray.wurlod
Unless some very specific conditions apply, including that your sequential file is in fixed width format, it will not be processed on more than one node (that is, sequentially).

Therefore, whether you follow the Sequential File stage with a Filter stage, a Transformer stage or some other stage kind, if you force that stage to operate sequentially you can reliably ensure that there is but one first row.

Another possibility is to rely on the fact that the header row is unlikely to match the schema, so you could trap the header row in a reject link from the Sequential File stage.

Posted: Tue Aug 16, 2005 8:56 pm
by ssgrpid
Thanks a lot for the help guys. I was working on 7.1 and this doesnot have an option to remove first row. DS7.5 does have such option.