DS questions

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
mydsworld
Participant
Posts: 321
Joined: Thu Sep 07, 2006 3:55 am

DS questions

Post by mydsworld »

Please let me know the following :

1. How can we perform a incremental data load in datastage
2. Can we restart a job sequence from a certain point on failure instead of running the whole sequence
I_Server_Whale
Premium Member
Premium Member
Posts: 1255
Joined: Wed Feb 02, 2005 11:54 am
Location: United States of America

Post by I_Server_Whale »

Search the forum. They have been covered before. Looks like they are some interview questions.
Anything that won't sell, I don't want to invent. Its sale is proof of utility, and utility is success.
Author: Thomas A. Edison 1847-1931, American Inventor, Entrepreneur, Founder of GE
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Short answers:

1) Yes, any kind of load actually.

2) Yes, and the methodology is documented in the manuals.

As noted, you'll find many, many (many) conversations on those topics here.
-craig

"You can never have too many knives" -- Logan Nine Fingers
michaeld
Premium Member
Premium Member
Posts: 88
Joined: Tue Apr 04, 2006 8:42 am
Location: Toronto, Canada

Post by michaeld »

>1. How can we perform a incremental data load in datastage

this is more of an ETL question and not a DS question. One method is to maintain a copy of the target table as a dataset and check for the difference with your input file every time the job starts. That will get you an incremental data set.

A lot of times it is faster to do this using the database so that you do not need to send a lot of data over the network. You can maintain incremental changes with triggers or do the difference of two table dumps (previous and current) on the database server. It depends on your system setup.

>2. Can we restart a job sequence from a certain point on failure instead of running the whole sequence

In your sequence job, you have to go to job properties and check the restartable option. This can also be set in Administrator as a project wide setting. A job will only restart from a checkpoint if it fails. Otherwise it will think that the run was successful. So you can either set it to "fail after 1 warning" in director or make it fail on a warning in the sequence job itselft.
Mike
Post Reply