Filter in sequential file.

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
Christina
Participant
Posts: 34
Joined: Wed Sep 06, 2006 11:50 am

Filter in sequential file.

Post by Christina »

My source file contains 24 months data. I have to extract first 12 months data from that file using filter command in sequential file. Which command i have to use to achive this requirement.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

What makes you think you have to use the filter command to achieve this? :?

That sounds more like a job for a Transformer constraint. How will you recognize these 'first 12 months' of data? What is the business rule you are attempting to implement? The answer to that question will help determine the appropriate place to do the dirty deed.
-craig

"You can never have too many knives" -- Logan Nine Fingers
Christina
Participant
Posts: 34
Joined: Wed Sep 06, 2006 11:50 am

hi

Post by Christina »

We already have a job for this with transformer constraints. Since the data volume is high it is taking more time. So we are trying for some methods to improve the performance. Is there anyother way to achive this other than transformer constraints?(which will reduce the time taken).

thanks in advance.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

chulett wrote:How will you recognize these 'first 12 months' of data? What is the business rule you are attempting to implement? The answer to that question will help determine the appropriate place to do the dirty deed.
:?:

ps. Unless your constraints are horribly complex, I don't see how they could be your 'taking more time' culprit. We'd have to have a better idea of your job design in order to provide any specific help there.
-craig

"You can never have too many knives" -- Logan Nine Fingers
Christina
Participant
Posts: 34
Joined: Wed Sep 06, 2006 11:50 am

Post by Christina »

constraints are not that much complex. We will be passing the first month value and will compare this with the date value in the source file and will find the month difference. Based on this month difference we are splitting this 24 months data into 2 files of 12 months data. we have nearly 100 million records. It is taking 4 to 5 hours to load this files.
can we reduce this time.
DSguru2B
Charter Member
Charter Member
Posts: 6854
Joined: Wed Feb 09, 2005 3:44 pm
Location: Houston, TX

Post by DSguru2B »

Make your job multiple instance. Read a subset for each instance. Depending upon the resources available, run as many instances simultaneously.
A much faster approach, bulk load the file into a work table and write a sql query to extract the data. Again two sets of 12 months each.
Creativity is allowing yourself to make mistakes. Art is knowing which ones to keep.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

In other words - it's not the constraints. Seek your solution elsewhere. You've already gotten a couple of good suggestions, let us know if you need any help with them.
-craig

"You can never have too many knives" -- Logan Nine Fingers
Post Reply