Need information on Data Extract approach

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
nvuradi
Participant
Posts: 54
Joined: Tue Dec 12, 2006 11:03 pm

Need information on Data Extract approach

Post by nvuradi »

Hi All,



We need to extract the data from OLTP system which is on Mainframe DB2 host

We have two approaches to extract the data from DB2 tables:

1. Using DB2 API Stage which directly fetches the data from DB2 tables.
2. Using JCL scripts to fetch the data from Mainframe and ftp the files to UNIX box. and process those files using a sequential stage.

can anyone suggest which approach is the best one taking into consideration the performance aspect. our jobs will be run daily insert/updating around million records.


Any help will be highly appreciated. :)

Thank you
Narendar
keshav0307
Premium Member
Premium Member
Posts: 783
Joined: Mon Jan 16, 2006 10:17 pm
Location: Sydney, Australia

Post by keshav0307 »

what is your target, where do you want to load the extracted data
nvuradi
Participant
Posts: 54
Joined: Tue Dec 12, 2006 11:03 pm

Post by nvuradi »

keshav0307 wrote:what is your target, where do you want to load the extracted data
data will be loaded to DB2UDB database.


Thanks,
Narendar
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

How many Mb are you looking at? If you use the API then the DataStage server controls when and how to do the job run. If you need to extract & ftp the data then another group is involved and controls if things will run. This would be push instead of pull technology.
Mallayagari
Participant
Posts: 15
Joined: Tue Apr 20, 2004 3:10 am

Re: Need information on Data Extract approach

Post by Mallayagari »

IF you have enough time to pull the records from mainframe using API stage then its better to use API stage. If time is critical factor to your processing then extract and ftp the data is the best method. How much time do you have to pull this data?

[quote="nvuradi"]Hi All,



We need to extract the data from OLTP system which is on Mainframe DB2 host

We have two approaches to extract the data from DB2 tables:

1. Using DB2 API Stage which directly fetches the data from DB2 tables.
2. Using JCL scripts to fetch the data from Mainframe and ftp the files to UNIX box. and process those files using a sequential stage.

can anyone suggest which approach is the best one taking into consideration the performance aspect. our jobs will be run daily insert/updating around million records.


Any help will be highly appreciated. :)

Thank you
Narendar[/quote]
Post Reply