Sequential File - Logic required

DSDexter · Post by **DSDexter** » Thu Jun 19, 2008 6:14 am

Hi Gurus,

I have a flat file with some records and trailer records which will contain the records count. Below this count column I am getting a blank line and a EOF char at the next line as shown below

Code: Select all


col1,col2,col3\n
100\n
................................................................................\n
EOF

(I am including \n for better understanding only).

How can i process lines only above the record count. Just to add to it. The no. of blank lines are not consistent

Can I get rid of all the unwanted lines using filter property of Seq. Stage?

Any help will be appreciated.

ArndW · Post by **ArndW** » Thu Jun 19, 2008 6:20 am

If you have multiple columns in real data records and only one column (no commas) in unwanted records you can use the reject row facility in the flat file stage to discard those.

DSDexter · Post by **DSDexter** » Thu Jun 19, 2008 6:46 am

Andrw,

But in that case it will throw a warning in the log, saying import unsucessfull. And I dont want that to happen.
Also on the reject link I have a transformer which will abort the job if a single reject is encountered (Its not evil, It's the requirement)

.

So I have to avoid above two scenarios.

ArndW · Post by **ArndW** » Thu Jun 19, 2008 6:56 am

You can use an external filter and implement your choice of awk/sed or other program call that will only copy the part of the file you want or declare the file with just one big string record, run it through a transform to use substring commands to filter out the unwanted records, then use a column export stage to create your columns.

DSDexter · Post by **DSDexter** » Thu Jun 19, 2008 8:20 am

Andrw,

I am using the following approach

1. Remove all the blank lines using

Code: Select all

sed -e "/^[ ]*$/d"

2. Get the line count now using wc -l
3. head (above result -1) records.

Can I use multiple filters, pipe seperated in external filter stage?

ArndW · Post by **ArndW** » Thu Jun 19, 2008 8:50 am

Yes, you can use multiple filters. You might encounter runtime errors (I can't recall what they were) when using multiple pipes, but that can be solved by making the command "sh" and the arguments "'{your commands}'"