CDC Operator

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
nvalia
Premium Member
Premium Member
Posts: 180
Joined: Thu May 26, 2005 6:44 am

CDC Operator

Post by nvalia »

Hi,

What is operator used for in the CDC stage?
Since all of the operators are written in C++, would it be correct to say that they internally use functions/logic or the algorithms inbuilt in C++ to compare the 2 datasets?

Regards,
NV
DSguru2B
Charter Member
Charter Member
Posts: 6854
Joined: Wed Feb 09, 2005 3:44 pm
Location: Houston, TX

Post by DSguru2B »

More like C. Dont forget the osh engine. So I would say C and osh.
Creativity is allowing yourself to make mistakes. Art is knowing which ones to keep.
nvalia
Premium Member
Premium Member
Posts: 180
Joined: Thu May 26, 2005 6:44 am

Post by nvalia »

Kumar, thanks for your prompt respomse.

2 questions:
1. Is creating a Seq file faster than creating a Dataset from a DB table?

2. Would the process of creating datasets from a RDBMS be faster than teh process of comparing them using the CDC stage ?
DSguru2B
Charter Member
Charter Member
Posts: 6854
Joined: Wed Feb 09, 2005 3:44 pm
Location: Houston, TX

Post by DSguru2B »

I am not Kumar.
Why dont you perform the bench marks and tell us which one is better, faster and why?
I did not get your second point, what does the creation of dataset have to do with CDC's performance :?
Creativity is allowing yourself to make mistakes. Art is knowing which ones to keep.
nvalia
Premium Member
Premium Member
Posts: 180
Joined: Thu May 26, 2005 6:44 am

Post by nvalia »

Thanks DSGuru2B.

I am at client location and currently do not have access to Datastage.
Offcourse, I will perform these benchmark tests as soon as the set up is ready.

The only reason I ask is, if some has already done a similar analysis, it can be leveraged by others.

My 2nd question meant that, if creating 2 datasets for use with CDC takes 5 mins..should the process of Change capture on these 2 Datasets take more than 5 mins or less?
Post Reply