How to generate row numbers for each file-Job Design

sureshreddy2009 · Post by **sureshreddy2009** » Thu Jun 03, 2010 1:29 am

Hi

I have files in my inbound directory as
whs_e1_20100502.txt,
whs_e2_20100502.txt,
whs_e3_20100502.txt. dates are changing for every day load.

Now I have to generate the row numbers for each and every file. For example if first file contains 100 records, numbers will be generated in a count column as 1,2,3...100 for all records. If second file contains 30 records then count column will have numbers as 1,2,3...30.
some day multi batch scenario is also there like I am running 3 days batch at a time. For this example there are files for so many days with names whs_e1*.txt , in the place of * dates are coming.

Please tell me the job design to implement this... Thanks in advance for your solutions.

zulfi123786 · Post by **zulfi123786** » Thu Jun 03, 2010 3:50 am

If you are having specific count of files to read then you can use a transformer and create a column with deriavation as @OUTROWNUM system variable instead if you are not sure of how many files you would be reading each time then you can specify the multiple files to read option in this case you will not have the numbers restarted each time for different files.

In such case you can have a small unix script to do the same where you can generate numbers for each file using the cat -n option

sureshreddy2009 · Post by **sureshreddy2009** » Thu Jun 03, 2010 5:01 am

Hi,

I got the answer for my question/scenario
Here is the solution.
I read all the files through sequential file stage by putting file pattern , I put another option file name column. after sequential file stage I used sort stage and is sorted based on file name column including hash paritioning on file name column. after that in transformer stage variables based on file name column by using previous key and after key logic. i generated numbers for each and every file starting from number 1 incrementing 1 by 1 up to completion of all records in that file. and atlast i loaded in the target table.

ray.wurlod · Post by **ray.wurlod** » Thu Jun 03, 2010 5:50 am

You can't do that in a server job.
(Moved, as requested)

chulett · Post by **chulett** » Thu Jun 03, 2010 5:55 am

While you may not be able to do that, you can certainly do that in a Server job.

sureshreddy2009 · Post by **sureshreddy2009** » Thu Jun 03, 2010 6:04 am

If your post is explaining about this logic will be implement only in server jobs not in paralell jobs.

I can say one thing that I implemented in parallel jobs 8.0.1
I tested the scenario also.

chulett · Post by **chulett** » Thu Jun 03, 2010 6:08 am

No, the comment from Ray is directed at your choice of forum to post in.

sureshreddy2009 · Post by **sureshreddy2009** » Thu Jun 03, 2010 6:12 am

But my problem is related to parallel jobs that is why I posted in parallel jobs forum.
Any how I got it thanks

chulett · Post by **chulett** » Thu Jun 03, 2010 6:21 am

Double-check where you posted, where we are right now. We are not in the parallel jobs forum.

sureshreddy2009 · Post by **sureshreddy2009** » Thu Jun 03, 2010 6:27 am

Next time I will take care the forum selection

.. Thanks a ton

DSXchange

How to generate row numbers for each file-Job Design

How to generate row numbers for each file-Job Design

Now Got the answer