Hi ,
I need templates/inputs as to how to create a data profiling sheet (process).
Being in ETL all these days I dont have much idea as to how to document a profiling process.
I would appreciate any help on this.
Thanks in advance,
Regards,
Santosh.
Search found 35 matches
- Fri Oct 19, 2007 1:39 am
- Forum: Information Analyzer (formerly ProfileStage)
- Topic: Information Analyzer Template
- Replies: 2
- Views: 3308
- Mon Jan 29, 2007 2:06 pm
- Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- Topic: Data Model
- Replies: 2
- Views: 1549
Data Model
Hi All,
I wanted to know what is the best way to create actual data model in a database.
I have a Erwin model (datamart) , should I just use forward engineer option and run the script (unchanged) in the backend database
Thanks.
I wanted to know what is the best way to create actual data model in a database.
I have a Erwin model (datamart) , should I just use forward engineer option and run the script (unchanged) in the backend database
Thanks.
- Fri Jan 26, 2007 12:06 am
- Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- Topic: Dimension View
- Replies: 8
- Views: 2829
- Thu Jan 25, 2007 11:59 pm
- Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- Topic: Dimension View
- Replies: 8
- Views: 2829
Ray, I ve got your point. I lookup the dimension twice with two different dates (coming from source) and fact can have 2 dw (calendar ) keys for both dates in the same record. I was just wondering the idea of using Calendar DW key as DATE datatype. Would that help? Thanks, Santosh. Or two lookups, a...
- Thu Jan 25, 2007 5:25 pm
- Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- Topic: Dimension View
- Replies: 8
- Views: 2829
Hi Ray, I have two DATE fields going into same FACT table. Instead of having the date as it is the in the FACT I want to populate the CALENDAR key. This way I can have all the information ( which a CALENDAR dimension provides) for both the dates. It should be physical join in the database to access ...
- Thu Jan 25, 2007 1:43 pm
- Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- Topic: Fact to Fact
- Replies: 8
- Views: 3078
Fact to Fact
Hi Again,
I have a mindset where I am tempted to join two facts.
A fact to fact join.
Am I commiting suicide !!
Thanks.
I have a mindset where I am tempted to join two facts.
A fact to fact join.
Am I commiting suicide !!
Thanks.
- Thu Jan 25, 2007 1:39 pm
- Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- Topic: Dimension View
- Replies: 8
- Views: 2829
Dimension View
Hi All, Can we create a DATABASE VIEW of a dimension table , In my case CALENDAR dimension and then use this view as lookup in fact load. I have multiple date fields going into fact , at the same time I need to associate these dates with Calendar dimension obviously to get related information. Is it...
- Tue Jan 02, 2007 3:06 pm
- Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- Topic: Joining Dimensions in a data warehouse - is it possible?
- Replies: 3
- Views: 1772
Joining Dimensions in a data warehouse - is it possible?
Hi All, I have a situation wherein I need to look into probability of Joining 2 dimension in a data mart which we are designing. I am exploring the possibility of joining two dimension using joiner table (this I am doing coz I need to show many- to- many relationship between these dimensions). i.e. ...
- Wed Dec 06, 2006 11:00 pm
- Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- Topic: COMPLEX FLAT FILE STAGE IN GRID
- Replies: 5
- Views: 2238
- Wed Dec 06, 2006 4:21 pm
- Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- Topic: COMPLEX FLAT FILE STAGE IN GRID
- Replies: 5
- Views: 2238
COMPLEX FLAT FILE STAGE IN GRID
Hi,
I am reading an EBCIDIC file and loading to a dataset.I tested the job on SMP system it is takeing one minute but when i do the same on DS 7.5.1A GRID box job takes 10 minutes.
how can i improve the performance.It is fixed width file.
Any help is appreciated.
I am reading an EBCIDIC file and loading to a dataset.I tested the job on SMP system it is takeing one minute but when i do the same on DS 7.5.1A GRID box job takes 10 minutes.
how can i improve the performance.It is fixed width file.
Any help is appreciated.
- Tue Dec 05, 2006 9:22 pm
- Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- Topic: Pivot perfomance is bad
- Replies: 6
- Views: 2337
- Tue Dec 05, 2006 4:20 pm
- Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- Topic: Pivot perfomance is bad
- Replies: 6
- Views: 2337
- Tue Dec 05, 2006 1:28 pm
- Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- Topic: Pivot perfomance is bad
- Replies: 6
- Views: 2337
- Tue Dec 05, 2006 11:50 am
- Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- Topic: Pivot perfomance is bad
- Replies: 6
- Views: 2337
Pivot perfomance is bad
Hi, I am running a job in DS7.5.1A and it is taking for 0.6 million records 3 hours. Design: oracle source--->Xfm------>Pivot----------->DataSet. From oracle stage to Xfm it is taking only 3 minutes. Pivot stage is the overhead. How can i improve the performance. I am pivoting 16 columns to rows. Th...
- Mon Dec 04, 2006 7:17 am
- Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- Topic: DataStage GRID
- Replies: 6
- Views: 4386
Why dont you use the same configuration file that created the dataset , for the second job? The issue might not exactly be becuase you are using a different config file. It might be because - for the second job you are tyring to use a config file that does not include the nodes that were used to cr...