Search found 5 matches

by biju.chiramel
Thu Dec 17, 2009 1:44 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: How to remove duplicates
Replies: 6
Views: 3063

May be


We can have link collector with 3 input links for the sequential files with same meta data.... then sort stage on AcctID.... then aggregator stage with group by on AcctID and "first" function on other fields...

Thanks
by biju.chiramel
Thu Dec 17, 2009 12:30 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Error compiling Job Control Subroutine <JobName>
Replies: 7
Views: 11660

May be its because in original project you have
WORKDIR
SCRIPTDIR with default values set...

and in current project nothing set for the variables.

Please check in job properties and environment variables...
by biju.chiramel
Wed Dec 16, 2009 6:18 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: multiple update statements
Replies: 6
Views: 3249

Re: multiple update statements

As per what I understood... suppose A, B are the key columns C, D, E are to be updated and F, G, H not to be updated then pass F, G, H as NULL values... and use only one statement like update TABLE SET C = NVL(:3, C), D = NVL(:4, D), E = NVL(:5, E), F = NVL(:6, F), G = NVL(:7, G), H = NVL(:8, H) WHE...
by biju.chiramel
Wed Dec 16, 2009 6:04 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: output value from a job as input of a user variable activity
Replies: 2
Views: 1339

Re: output value from a job as input of a user variable acti

According to me... Option 1. store the timestamp of run in a hashfile. pull all the data from source. filter with hash value and propagate only new records. Option 2. Store timestamp in hashfile create routine to read the hashfile and output the value. call this routine in Routine Activity. Pass the...
by biju.chiramel
Wed Dec 09, 2009 12:15 am
Forum: General
Topic: Making a job to run on Multiple Instances
Replies: 4
Views: 2011

Re: Making a job to run on Multiple Instances

Hi,

According to me.... the name should be unique for calling the job. So usually <Parent Job>.<Child Job> should be the name format.

Select the stages which are suitable for parallel run and requirement