how to handle duplicate records in sequential file

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
ds_dwh
Participant
Posts: 39
Joined: Fri May 14, 2010 6:06 am

how to handle duplicate records in sequential file

Post by ds_dwh »

hi,

how to handle duplicate records in sequential file

either deleting or sending to rej link
ANJI
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

There are multiple ways to identify duplicate records regardless of source. What have you tried?
-craig

"You can never have too many knives" -- Logan Nine Fingers
nagarjuna
Premium Member
Premium Member
Posts: 533
Joined: Fri Jun 27, 2008 9:11 pm
Location: Chicago

Post by nagarjuna »

duplicates stage , sort stage , transformer stage ......
Nag
ds_dwh
Participant
Posts: 39
Joined: Fri May 14, 2010 6:06 am

with using sequential file only

Post by ds_dwh »

nagarjuna wrote:duplicates stage , sort stage , transformer stage ......
with using sequential file only

not using sort,remove duplicate,T/F
ANJI
Sainath.Srinivasan
Participant
Posts: 3337
Joined: Mon Jan 17, 2005 4:49 am
Location: United Kingdom

Post by Sainath.Srinivasan »

Is this an interview question ?

Maybe you can edit the file and manually remove them !!
kondeti
Premium Member
Premium Member
Posts: 67
Joined: Sat Mar 04, 2006 11:38 am

Post by kondeti »

There is an option available in Sequential File Stage called Filter. This option directly deals with Operating System where your DataStage server is Installed either Unix Flavour or Windows. Write respective script(either dos/Unix based on your Operating system) in the Filter option and ristrict duplicate records. Thank you.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Re: with using sequential file only

Post by chulett »

ds_dwh wrote:
nagarjuna wrote:duplicates stage , sort stage , transformer stage ......
with using sequential file only
Why? :?
-craig

"You can never have too many knives" -- Logan Nine Fingers
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Use Filter command in Sequential File stage. The command sort -u #filepath# might be a good choice.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply