DSXchange

mhester

I recommend that you get some training/mentoring in ETL or research or read some books. This would help you far more than the quick fix you will get here! If you learn it and then apply it you will own it, but if we give it to you then you will have learned nothing. There is a plethora of material o...

mhester

Craig,

AIX 5.1

Thanks for pointing that out.

mhester

Does anyone know why this might be happening? From what I understand the sorts were "reasonable" prior to the migration to JFS2, but now they are much slower.

Any ideas would be greatly appreciated.

Thanks!

mhester

D, Your presumption is correct and your solution worked wonderfully! - Thanks :-) The following rows of data - aa|[1,0]3009|[1,1]502|[1,2]PI Svc Timeliness|[1,3]2005|[1,4]11|[1,5]AF|[1,6]EPIR|[1,7]New Business|[1,8]10|[1,9]10|[1,10]1000|[1,11]980| aa|[1,0]3009|[1,1]502|[1,2]PI Svc Timeliness|[1,3]20...

mhester

The main issue here is that you cannot (easily) retrieve the value of an identity column in a job processing updates and inserts without either building a hash lookup of what's already there or doing as you are - using a relational lookup. If your scenario is such that rows will not be duplicated wi...

mhester

I implemented Arnd's solution (thanks Arnd!) and it works wonderfully. I just wanted to broaden my knowledge and do it in a way that I am not so familiar with.

mhester

I really thought this question would have solicited a response from the Duke-a-nator!

Come on Kim..... give me the Unix one-line command answer

mhester

Ken and Arnd,

Thanks!

Both are solid solutions. I had hoped to do it via one of the commands I listed but I do understand that this may not be possible with a simple command.

Thanks again

mhester

Here's the situation..... I have an input file which contains rows of data that look something like the following - [1,0]3009|[1,1]502|[1,2]PI Svc Timeliness|[1,3]2005|[1,4]11|[1,5]AF|[1,6]EPIR|[1,7]New Business|[1,8]10|[1,9]10|[1,10]1000|[1,11]980 Each field is separated by a "|" delimite...

mhester

I think going the hybrid route is a bad idea and I'll give you an example. Let's say you have a table named DIM_STORE. You decide that some columns in the table are type 1 and some are type 2. To avoid anomolies, each time any of the type 1 attributes change you must go back and update all rows base...

mhester

The crc32 algorithm returns 4 bytes. The longer your input data string the more likely it will be that you will get an encoding collision. Of course it returns 4 bytes or 32 bits. I don't claim to fully understand the mathematics of CRC, but I do believe that string length does not really affect th...

mhester

Craig,

I might just do that - thanks

mhester

It's actually 2^32 or 1 in 4,294,967,296 and that is for every row. It does not mean that an incorrect CRC will be generated if you process 4294967296 rows of data, rather each row has a 1 in 4294967296 chance. Not likely that this will fail for you. Starbucks has been using this for 3+ years and to...

mhester

Just a thought..... but I wonder if rebuilding the repository indexes might make this problem go away?

mhester

If you have indeed checked the box on the job properties named "Allow Multiple Instance" and have recompiled the job then you should see a drop down with a title called "Invocation Id" on the same tab as the parameters when you run the job. At this point you can assign an invocat...

DSXchange

Search found 520 matches

Slow sort and hash access after migration to JFS2 from JFS

Help with using sed, awk, nawk or tr