Search found 592 matches

by JoshGeorge
Thu Mar 06, 2008 10:33 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Print Filenames from Seq File using FilePattern
Replies: 6
Views: 6650

Set environment variable APT_IMPORT_PATTERN_USES_FILESET to TRUE and write your output to another Sequential file and see the result.
by JoshGeorge
Tue Mar 04, 2008 2:40 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: TeraData Enterprise Stage Problem
Replies: 6
Views: 4363

Try minimizing repartitioning by increasing the number of sessions.
by JoshGeorge
Sun Mar 02, 2008 12:10 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Writting parallel routine
Replies: 2
Views: 1518

I have a blog post: DataStage Parallel routines made really easy. All your queries are addressed in that blog.
by JoshGeorge
Thu Feb 28, 2008 9:46 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Parallel Subroutines
Replies: 2
Views: 1376

I have a blog post: DataStage Parallel routines made really easy. You might find it helpful.
by JoshGeorge
Tue Feb 26, 2008 4:09 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Teradata Enterprise Stage
Replies: 7
Views: 5275

When you change the option from append to truncate in the target stage aren't you are making it easy for datastage? By truncating the target table you are satisfying the main criteria for invoking FastLoad directly (Remeber: For Teradata fastload target table must be empty). If you can, post more de...
by JoshGeorge
Fri Feb 22, 2008 9:05 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: How to use TPump Utility in datastage
Replies: 1
Views: 2112

Have a look at this blog post Datastage 7x Enterprise Edition with Teradata, it has details about Teradata Utilities (including TPUMP) which can be used in DataStage.
by JoshGeorge
Thu Feb 21, 2008 5:10 pm
Forum: General
Topic: Dynamic Output File Names
Replies: 8
Views: 3214

For Job Type: Parallel, a parallel (C++) routine which creates files dynamically will be the best way to do this using a single job. Aggregate / Collect all the records related to each customer in a transformer using stage variables (Append new line character for each record) and use a remove duplic...
by JoshGeorge
Mon Feb 18, 2008 9:20 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Generation of Surrogate Key
Replies: 20
Views: 14431

For Kenneth: Good post. Again I'm not trying to debate on the best approach to take the seed. Undoubtedly, it is from the database. Approach I posted is on a complete tool based one. Thanks to you Kenneth! I have included your point and updated the blog post. This case you need to pick the max value...
by JoshGeorge
Mon Feb 18, 2008 7:12 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Generation of Surrogate Key
Replies: 20
Views: 14431

Thanks Kenneth! This post is a complete solution using the tool. Have emphasised my point on that and haven't touched the "database vs tool" topic either. Have given notes on how the design makes sure the latest value is stored and retrieved while several jobs try to generate surrogate key...
by JoshGeorge
Mon Feb 18, 2008 6:41 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: UserStatus warning
Replies: 4
Views: 1595

You can create a parallel routine and avoid using a Basic transformer. See if this POST helps. Call this parallel routine only once and read the value using a basic routine from sequence.
by JoshGeorge
Mon Feb 18, 2008 6:04 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Preserve the value of a particular field in transformer
Replies: 7
Views: 2333

Yes, using a parallel routine. Option 1: Set and get job / environment parameter. But not an optimised method for this requirement. Option 2: Write to a file from the first transformer and read the file in the next, again not an optimised method. Best for this requirement is the one noted above by R...
by JoshGeorge
Mon Feb 18, 2008 5:45 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Generation of Surrogate Key
Replies: 20
Views: 14431

I have a blog post on the same - Surrogate Key Generation in DataStage - An elegant way. If you know to use DataStage Job Control Interfaces specified in Advanced developer guide this will be really helpful.
by JoshGeorge
Fri Feb 15, 2008 3:43 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Combinine two or more identical rows
Replies: 6
Views: 2358

Use stage variables in transformer and adopt remove duplicate strategy to merge fields. Explore Sort stage with keychange as well.
by JoshGeorge
Fri Feb 15, 2008 3:25 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Datastage job scheduling on Windows environment
Replies: 4
Views: 3324

Re: Datastage job scheduling on Windows environment

If "watch for a file to kickoff the datastage job" is your requirement, then you don't have to think on a "scheduler" line. Wait for file stage in the sequence will do this for you. I have to watch for a file to kickoff the datastage job. Can some please let me know how to achiev...
by JoshGeorge
Thu Feb 14, 2008 4:43 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: DataStage Export
Replies: 3
Views: 1911

Version Control tool in DataStage Release 7x is worth looking into. There is a good post in FAQ section. Many uses this tool for backups as well.