I have a source table which resides in a remote DB. I get the records from this table based on the last_update_date column. Unfortunately, this table is not range-partitioned in the last_update_date column. My query is simple, no joins and so gets only from one table. As simple as:
Code: Select all
SELECT col1, col2, col3
FROM sourcetab
WHERE last_update_date >= "<cut-off-date>"
It takes a long time for my job to fetch the records from this remote source. We would like to tune it first at the DataStage level as much as possible, before going to the database level (such as using dblink, partitioning - ifever they may help).
I cannot think of any way to optimize this job. I tried using an intermediary container such as a temp file:
SOURCE --> temp file --> DW Staging
But this does not help..rather takes longer to finish. Any idea??