Search found 520 matches

by mhester
Wed Jun 07, 2006 2:53 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Extract Requirements of Data
Replies: 4
Views: 1481

I recommend that you get some training/mentoring in ETL or research or read some books. This would help you far more than the quick fix you will get here! If you learn it and then apply it you will own it, but if we give it to you then you will have learned nothing. There is a plethora of material o...
by mhester
Mon Jun 05, 2006 1:39 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Slow sort and hash access after migration to JFS2 from JFS
Replies: 2
Views: 861

Craig,

AIX 5.1

Thanks for pointing that out.
by mhester
Mon Jun 05, 2006 12:47 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Slow sort and hash access after migration to JFS2 from JFS
Replies: 2
Views: 861

Slow sort and hash access after migration to JFS2 from JFS

Does anyone know why this might be happening? From what I understand the sorts were "reasonable" prior to the migration to JFS2, but now they are much slower.

Any ideas would be greatly appreciated.

Thanks!
by mhester
Fri Jan 06, 2006 8:26 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Help with using sed, awk, nawk or tr
Replies: 22
Views: 6018

D, Your presumption is correct and your solution worked wonderfully! - Thanks :-) The following rows of data - aa|[1,0]3009|[1,1]502|[1,2]PI Svc Timeliness|[1,3]2005|[1,4]11|[1,5]AF|[1,6]EPIR|[1,7]New Business|[1,8]10|[1,9]10|[1,10]1000|[1,11]980| aa|[1,0]3009|[1,1]502|[1,2]PI Svc Timeliness|[1,3]20...
by mhester
Thu Jan 05, 2006 7:24 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Performance Issue - identity column??
Replies: 3
Views: 1974

The main issue here is that you cannot (easily) retrieve the value of an identity column in a job processing updates and inserts without either building a hash lookup of what's already there or doing as you are - using a relational lookup. If your scenario is such that rows will not be duplicated wi...
by mhester
Thu Jan 05, 2006 5:37 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Help with using sed, awk, nawk or tr
Replies: 22
Views: 6018

I implemented Arnd's solution (thanks Arnd!) and it works wonderfully. I just wanted to broaden my knowledge and do it in a way that I am not so familiar with.
by mhester
Thu Jan 05, 2006 4:52 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Help with using sed, awk, nawk or tr
Replies: 22
Views: 6018

I really thought this question would have solicited a response from the Duke-a-nator!

Come on Kim..... give me the Unix one-line command answer :-)
by mhester
Thu Jan 05, 2006 4:42 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Help with using sed, awk, nawk or tr
Replies: 22
Views: 6018

Ken and Arnd,

Thanks!

Both are solid solutions. I had hoped to do it via one of the commands I listed but I do understand that this may not be possible with a simple command.

Thanks again
by mhester
Thu Jan 05, 2006 3:20 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Help with using sed, awk, nawk or tr
Replies: 22
Views: 6018

Help with using sed, awk, nawk or tr

Here's the situation..... I have an input file which contains rows of data that look something like the following - [1,0]3009|[1,1]502|[1,2]PI Svc Timeliness|[1,3]2005|[1,4]11|[1,5]AF|[1,6]EPIR|[1,7]New Business|[1,8]10|[1,9]10|[1,10]1000|[1,11]980 Each field is separated by a "|" delimite...
by mhester
Thu Aug 11, 2005 12:33 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: SCD type at column level
Replies: 5
Views: 1683

I think going the hybrid route is a bad idea and I'll give you an example. Let's say you have a table named DIM_STORE. You decide that some columns in the table are type 1 and some are type 2. To avoid anomolies, each time any of the type 1 attributes change you must go back and update all rows base...
by mhester
Wed Jul 27, 2005 12:50 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: CRC32
Replies: 5
Views: 1504

The crc32 algorithm returns 4 bytes. The longer your input data string the more likely it will be that you will get an encoding collision. Of course it returns 4 bytes or 32 bits. I don't claim to fully understand the mathematics of CRC, but I do believe that string length does not really affect th...
by mhester
Thu Jul 21, 2005 3:41 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: CRC32
Replies: 8
Views: 2668

Craig,

I might just do that - thanks :shock:
by mhester
Wed Jul 20, 2005 5:06 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: CRC32
Replies: 8
Views: 2668

It's actually 2^32 or 1 in 4,294,967,296 and that is for every row. It does not mean that an incorrect CRC will be generated if you process 4294967296 rows of data, rather each row has a 1 in 4294967296 chance. Not likely that this will fail for you. Starbucks has been using this for 3+ years and to...
by mhester
Tue Jun 07, 2005 3:41 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Unable to see instance of MultipleInstance Job in Director
Replies: 8
Views: 2649

Just a thought..... but I wonder if rebuilding the repository indexes might make this problem go away?
by mhester
Mon Jun 06, 2005 2:35 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Multi Instance Job
Replies: 7
Views: 2175

If you have indeed checked the box on the job properties named "Allow Multiple Instance" and have recompiled the job then you should see a drop down with a title called "Invocation Id" on the same tab as the parameters when you run the job. At this point you can assign an invocat...