Data set descriptor location
Posted: Fri Jan 16, 2009 12:59 am
Hi there,
I have a question regarding the set-up of data sets on a project. A data set has a descriptor and one or more data files (the actual number depending on how many nodes/partitioning is specified).
Now these data set data files will be stored by the Parallel Engine on the resource disk e.g. /disk1/Ascential/DataStage/DataSets but the location of the data set descriptor is determined by the path name specified in the Data Set stage in each job.
Can I just confirm what the best practice is (if one exists) about where the data set descriptors should be located - should they be in the same area as the data files i.e in the resource disk directory, or should they be located in a completely separate directory independent of the configuration file area?
Many thanks,
James
I have a question regarding the set-up of data sets on a project. A data set has a descriptor and one or more data files (the actual number depending on how many nodes/partitioning is specified).
Now these data set data files will be stored by the Parallel Engine on the resource disk e.g. /disk1/Ascential/DataStage/DataSets but the location of the data set descriptor is determined by the path name specified in the Data Set stage in each job.
Can I just confirm what the best practice is (if one exists) about where the data set descriptors should be located - should they be in the same area as the data files i.e in the resource disk directory, or should they be located in a completely separate directory independent of the configuration file area?
Many thanks,
James