Page 1 of 1

Reg Schema file

Posted: Tue Apr 13, 2010 4:53 am
by nani0907
Hi ,

for a data load ,i get two files source file with different format . how can i make my job generic to read the source files with has format diffrence .can some one help out to acheive this by schema file option.

Thanks in Advance

Posted: Tue Apr 13, 2010 5:23 am
by ray.wurlod
Depends.

What do you want to DO with the data records once you've read them?

Re: Reg Schema file

Posted: Tue Apr 13, 2010 6:32 am
by AVISSIVA
nani0907 wrote:Hi ,

for a data load ,i get two files source file with different format . how can i make my job generic to read the source files with has format diffrence .can some one help out to acheive this by schema file option.

Thanks in Advance

If you have file Identifier then you can make generic. Based on the file identifier you can create two data flows.

Posted: Tue Apr 13, 2010 7:33 am
by ShaneMuir
As Ray suggests, reading 2 different file types is quite easy with schema files, what is possibly more important is how you intend to use the data. Ie do you want to load into a database, a dataset, or another file. What is the format of the target - is this also 2 different schema?

Without answers to questions like these it is very difficult to suggest a suitable course of action.

Posted: Mon Apr 19, 2010 11:12 pm
by nani0907
once after reading the different delimilter files , the idea is to write to dataset and from there to do tranformations accordingly.

Posted: Wed Apr 21, 2010 8:14 am
by ShaneMuir
Is the dataset target to be in a common format? Or will the datasets be different for each file?

Is it possible to load them to a current format?

Posted: Thu Apr 22, 2010 4:18 am
by nani0907
i t will be dufferent format based on invocation id or in some cases may be same

Posted: Thu Apr 22, 2010 10:20 am
by ShaneMuir
So you will have to create a schema for each file that you wish to read.

You can apply this schema on the sequential file stage with RCP on and load the data directly into a dataset. Keep in mind that you will have to parameterise file and dataset names, as well as the schema names, and call these accordingly from your sequence.