Parallel loading with Referential Integrity

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
RaviReena
Premium Member
Premium Member
Posts: 68
Joined: Tue Jul 29, 2008 10:01 am

Parallel loading with Referential Integrity

Post by RaviReena »

I want to load/append the data to an empty table using ORACLE Enterprise stage in parallel.I have a referential integrity with other table for my target table and i want to reject/drop the rows which violates the referential Integrity.

what is the best design approach for this scenario of loading around 10million rows.


Any help is appericiated.
Rao V
prakashdasika
Premium Member
Premium Member
Posts: 72
Joined: Mon Jul 06, 2009 9:34 pm
Location: Sydney

Post by prakashdasika »

Join the dataset with the table with which you have the referential integrity and drop/reject the records and send only the ones that satisfy the constraints. You can use three stages:

1. Join - You can do a inner join and drop the records or do a left outer join and capture the records in the Transformer stage. Do not forget to sort the records on the keys

2. Merge - Similar to join, the data should be sorted and the links should be correctly choosen (MASTER AND UPDATE) and the rejected records can be directly captured in this stage itself.

3. Lookup - Define data stream and lookup stream properly and use the constraint to captue or drop the rejects. This can be a risky option due to volume of records.
Prakash Dasika
ETL Consultant
Sydney
Australia
Post Reply