DSXchange

JoshGeorge

Set environment variable APT_IMPORT_PATTERN_USES_FILESET to TRUE and write your output to another Sequential file and see the result.

JoshGeorge

Try minimizing repartitioning by increasing the number of sessions.

JoshGeorge

I have a blog post: DataStage Parallel routines made really easy. All your queries are addressed in that blog.

JoshGeorge

I have a blog post: DataStage Parallel routines made really easy. You might find it helpful.

JoshGeorge

When you change the option from append to truncate in the target stage aren't you are making it easy for datastage? By truncating the target table you are satisfying the main criteria for invoking FastLoad directly (Remeber: For Teradata fastload target table must be empty). If you can, post more de...

JoshGeorge

Have a look at this blog post Datastage 7x Enterprise Edition with Teradata, it has details about Teradata Utilities (including TPUMP) which can be used in DataStage.

JoshGeorge

For Job Type: Parallel, a parallel (C++) routine which creates files dynamically will be the best way to do this using a single job. Aggregate / Collect all the records related to each customer in a transformer using stage variables (Append new line character for each record) and use a remove duplic...

JoshGeorge

For Kenneth: Good post. Again I'm not trying to debate on the best approach to take the seed. Undoubtedly, it is from the database. Approach I posted is on a complete tool based one. Thanks to you Kenneth! I have included your point and updated the blog post. This case you need to pick the max value...

JoshGeorge

Thanks Kenneth! This post is a complete solution using the tool. Have emphasised my point on that and haven't touched the "database vs tool" topic either. Have given notes on how the design makes sure the latest value is stored and retrieved while several jobs try to generate surrogate key...

JoshGeorge

You can create a parallel routine and avoid using a Basic transformer. See if this POST helps. Call this parallel routine only once and read the value using a basic routine from sequence.

JoshGeorge

Yes, using a parallel routine. Option 1: Set and get job / environment parameter. But not an optimised method for this requirement. Option 2: Write to a file from the first transformer and read the file in the next, again not an optimised method. Best for this requirement is the one noted above by R...

JoshGeorge

I have a blog post on the same - Surrogate Key Generation in DataStage - An elegant way. If you know to use DataStage Job Control Interfaces specified in Advanced developer guide this will be really helpful.

JoshGeorge

Use stage variables in transformer and adopt remove duplicate strategy to merge fields. Explore Sort stage with keychange as well.

JoshGeorge

If "watch for a file to kickoff the datastage job" is your requirement, then you don't have to think on a "scheduler" line. Wait for file stage in the sequence will do this for you. I have to watch for a file to kickoff the datastage job. Can some please let me know how to achiev...

JoshGeorge

Version Control tool in DataStage Release 7x is worth looking into. There is a good post in FAQ section. Many uses this tool for backups as well.

DSXchange

Search found 592 matches

Re: Datastage job scheduling on Windows environment