Search found 520 matches
- Wed Jun 07, 2006 2:53 pm
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: Extract Requirements of Data
- Replies: 4
- Views: 1481
I recommend that you get some training/mentoring in ETL or research or read some books. This would help you far more than the quick fix you will get here! If you learn it and then apply it you will own it, but if we give it to you then you will have learned nothing. There is a plethora of material o...
- Mon Jun 05, 2006 1:39 pm
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: Slow sort and hash access after migration to JFS2 from JFS
- Replies: 2
- Views: 861
- Mon Jun 05, 2006 12:47 pm
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: Slow sort and hash access after migration to JFS2 from JFS
- Replies: 2
- Views: 861
Slow sort and hash access after migration to JFS2 from JFS
Does anyone know why this might be happening? From what I understand the sorts were "reasonable" prior to the migration to JFS2, but now they are much slower.
Any ideas would be greatly appreciated.
Thanks!
Any ideas would be greatly appreciated.
Thanks!
- Fri Jan 06, 2006 8:26 am
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: Help with using sed, awk, nawk or tr
- Replies: 22
- Views: 6018
D, Your presumption is correct and your solution worked wonderfully! - Thanks :-) The following rows of data - aa|[1,0]3009|[1,1]502|[1,2]PI Svc Timeliness|[1,3]2005|[1,4]11|[1,5]AF|[1,6]EPIR|[1,7]New Business|[1,8]10|[1,9]10|[1,10]1000|[1,11]980| aa|[1,0]3009|[1,1]502|[1,2]PI Svc Timeliness|[1,3]20...
- Thu Jan 05, 2006 7:24 pm
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: Performance Issue - identity column??
- Replies: 3
- Views: 1974
The main issue here is that you cannot (easily) retrieve the value of an identity column in a job processing updates and inserts without either building a hash lookup of what's already there or doing as you are - using a relational lookup. If your scenario is such that rows will not be duplicated wi...
- Thu Jan 05, 2006 5:37 pm
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: Help with using sed, awk, nawk or tr
- Replies: 22
- Views: 6018
- Thu Jan 05, 2006 4:52 pm
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: Help with using sed, awk, nawk or tr
- Replies: 22
- Views: 6018
- Thu Jan 05, 2006 4:42 pm
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: Help with using sed, awk, nawk or tr
- Replies: 22
- Views: 6018
- Thu Jan 05, 2006 3:20 pm
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: Help with using sed, awk, nawk or tr
- Replies: 22
- Views: 6018
Help with using sed, awk, nawk or tr
Here's the situation..... I have an input file which contains rows of data that look something like the following - [1,0]3009|[1,1]502|[1,2]PI Svc Timeliness|[1,3]2005|[1,4]11|[1,5]AF|[1,6]EPIR|[1,7]New Business|[1,8]10|[1,9]10|[1,10]1000|[1,11]980 Each field is separated by a "|" delimite...
- Thu Aug 11, 2005 12:33 pm
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: SCD type at column level
- Replies: 5
- Views: 1683
I think going the hybrid route is a bad idea and I'll give you an example. Let's say you have a table named DIM_STORE. You decide that some columns in the table are type 1 and some are type 2. To avoid anomolies, each time any of the type 1 attributes change you must go back and update all rows base...
- Wed Jul 27, 2005 12:50 pm
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: CRC32
- Replies: 5
- Views: 1504
The crc32 algorithm returns 4 bytes. The longer your input data string the more likely it will be that you will get an encoding collision. Of course it returns 4 bytes or 32 bits. I don't claim to fully understand the mathematics of CRC, but I do believe that string length does not really affect th...
- Thu Jul 21, 2005 3:41 pm
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: CRC32
- Replies: 8
- Views: 2668
- Wed Jul 20, 2005 5:06 pm
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: CRC32
- Replies: 8
- Views: 2668
It's actually 2^32 or 1 in 4,294,967,296 and that is for every row. It does not mean that an incorrect CRC will be generated if you process 4294967296 rows of data, rather each row has a 1 in 4294967296 chance. Not likely that this will fail for you. Starbucks has been using this for 3+ years and to...
- Tue Jun 07, 2005 3:41 pm
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: Unable to see instance of MultipleInstance Job in Director
- Replies: 8
- Views: 2649
- Mon Jun 06, 2005 2:35 pm
- Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
- Topic: Multi Instance Job
- Replies: 7
- Views: 2175