Hi,
I have a requirement that i want to join 2 data files having multiple keys a primary key.
I have read posts suggesting make one file as HASH and then joining with other file but as per documentation HASH file can have only 1 column as key. So how would i achieve this?
Secondly, Whats the best practice in DS of joining multiple tables as i am using TD and the Stage for fecthing large volume is not that efficient.
What if the size of the table is quite huge then whats the best strategy of joining these kind of tables.
Thanks in advance.
Multiple Keys join of Files
Moderators: chulett, rschirm, roy
Multiple Keys join of Files
Regards,
Affan
"Questioning is Half Knowledge"
Affan
"Questioning is Half Knowledge"
Re: Multiple Keys join of Files
Not sure where you read that, but it is incorrect... hashed file stages support multiple key columns.maffan76 wrote:I have read posts suggesting make one file as HASH and then joining with other file but as per documentation HASH file can have only 1 column as key.
-craig
"You can never have too many knives" -- Logan Nine Fingers
"You can never have too many knives" -- Logan Nine Fingers
Thanks for the clarification,
Now coming to the second part of the question is that if have to join source table data coming in a file with some target table in database, whats the optimal way of doing it, i mean one ways is to make a hash file and then join those or is there anyother way of doing it?
eventually have to generate a surrogate key of for the new records and assign exisitng generated key to the existing record while joining the 2 data sets with multiple keys as joining criteria.
the size of the existing table is quite big and will continue to grow so how do we handle this kind of situation in DS because exporting the data on every run of the job to HASH file will be time consuming.
Now coming to the second part of the question is that if have to join source table data coming in a file with some target table in database, whats the optimal way of doing it, i mean one ways is to make a hash file and then join those or is there anyother way of doing it?
eventually have to generate a surrogate key of for the new records and assign exisitng generated key to the existing record while joining the 2 data sets with multiple keys as joining criteria.
the size of the existing table is quite big and will continue to grow so how do we handle this kind of situation in DS because exporting the data on every run of the job to HASH file will be time consuming.
Regards,
Affan
"Questioning is Half Knowledge"
Affan
"Questioning is Half Knowledge"