Hi All,
I have a job with two remove duplicates , a lookup stage and two transformers. My source and target are database stages
I have to run job with around 30 million records. Length of a record is 50
Can anyone suggest how much space do I need to have?
how do i calculate the disk space needed?
Thanks
Disk space for a job
Moderators: chulett, rschirm, roy
If the data is not sorted, you may need to include even a sort stage.
It again depends on the number of duplicated the input contain, and the data of the lookup (either its a sparse or Lookup fileset).
If it is a lookupfileset, and if you have less records per group (duplicates) and with simple transformation logic, you will have very less usage of disk.
It again depends on the number of duplicated the input contain, and the data of the lookup (either its a sparse or Lookup fileset).
If it is a lookupfileset, and if you have less records per group (duplicates) and with simple transformation logic, you will have very less usage of disk.
Impossible doesn't mean 'it is not possible' actually means... 'NOBODY HAS DONE IT SO FAR'
-
- Participant
- Posts: 437
- Joined: Fri Oct 15, 2004 6:13 am
- Location: Pune, India
As data is going to target, it depends on record size in target and no. of records. So approx (record size * total records).
If you are asking about the scratch space required then is would be depndent on the physical memory available for the job in addition to what Kumar_s has mentioned in his post.
If you are asking about the scratch space required then is would be depndent on the physical memory available for the job in addition to what Kumar_s has mentioned in his post.
Regards,
S. Kirtikumar.
S. Kirtikumar.