We've a job as shown below
Code: Select all
Hashed File
|
DRS (Source) -> IPC -> Row merger -> Trans1 -> Trans2 -> Row splitter -> IPC -> Trans3 -> DRS (Target)
The DRS (source) contains roughly 9 million records and contains a select statement.
The reference Hashed file too contains roughly 9 million records. The hashed file details below -
Type 30 (64-bit)
1296359424 Jan 11 16:00 DATA.30
374102016 Jan 11 15:52 OVER.30
This job runs every night and the problem is it processes 10 rows/second and takes 1 hour 45 minutes to complete.
What I could analyze is, the extraction from source and lookup on Hashed file stage take a long time. Are there ways to improve the run time of this job by improving the lookup speed?
Please let me know if you require any more details.
Thanks.