Performance Test Problems with 64 bit Hash files on 8.7

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
fridge
Premium Member
Premium Member
Posts: 136
Joined: Sat Jan 10, 2004 8:51 am

Performance Test Problems with 64 bit Hash files on 8.7

Post by fridge »

Am currently running some performance testing in preperation for a migration from 8.1 aix to 8.7 linux.

I am running a very simple test Server job to populate a 64 bit hash file with several million rows. The same job with the same data and hash files created with same settings perform very differently on the two DS versions. Whereas the 8.1 aix job completes in 15 mins the 8.7 version runs into hours.

Although the jobs sit in different test environments, both have very little running on them (and the jobs have been ran at various times with the same results) and I am led to believe should be similar in set up.
Jobs which read/write from flat file to flat file, to dev/null and to a 32 bit hash file (volumes just under two gig) on both 8.1 and 8.7 all perform at similar speeds.
The 64 bit hash file was created with the mkdbfile command.
The DS tunable settings appear to match.

As anyone experienced any such performance issues with 64 bit hash files on 8.7 or can anyone offer advice as to what to check that might specifically impact on jobs using 64 bit hash files?

Cheers.
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Sorry - No advice or explanation for the issues.

If you compare the resultant files at 8.1 and 8.7 are they identically sized? Those would be the DATA.30 and OVER.30 files.
fridge
Premium Member
Premium Member
Posts: 136
Joined: Sat Jan 10, 2004 8:51 am

Post by fridge »

Hi Arnd - long time no speak (its Rich Q in the UK). How goes the sailing?

Yeh when they were first created the files did match.
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

I mean not after the mkdbfile but immediately after they were filled with data. If the source data is the same then both files should have exactly the same sizes immediately after filling.

If they are not identical that would point towards different configurations which, in turn, might explain the load time differences.

If the are identical then it might be due to physical things such as the target file system and underlying disk.

Rich - I'm not sailing yet, but am in the finishing phases of a project after which I'll relax in the sun on the boat again! How's Yorkshire?
fridge
Premium Member
Premium Member
Posts: 136
Joined: Sat Jan 10, 2004 8:51 am

Post by fridge »

Never got far enough to see the full results because of the time it was taking on 8.7 - I will run a test with a smaller volume.
fridge
Premium Member
Premium Member
Posts: 136
Joined: Sat Jan 10, 2004 8:51 am

Post by fridge »

Arnd,

I have cut the volumes down from 15 mill to 3 mill and the data and overflow files are exactly the same on both 8.7 and 8.1 test runs.

The job run time is twice as long (ran these jobs a few times) on 8.7. Its about 8,000 rps on 8.7 and 14,000 on 8.1. The startup time on the 8.1 job seems to be significamtly longer than on 8.7. Also the rps on the 8.1 job is fairly constant but on the 8.7 job it start out a bit lower than on the 8.1 job but then gradually falls away.

Any ideas?
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

No, I'm stymied as well for an explanation. You've stated that normal hashed file write speeds are similar and I'll assume that the files are on the same disk drive as the 64-bit ones so that should rule out different disk partition types/ I/O speeds being the culprit

Something is acting as a bottleneck here. Can you run the jobs and compare CPU loads on the two environments? If the job doesn't do much in the way of computation the CPU load shouldn't approach a high load but are the loads similar on both environments? Likewise can you monitor the I/O activity to the drive with the hashed files to see what the Mb/Sec loads are and if they are maxed out as well as seeing if the newer, slower 64Bit file is (for some reason) generating more I/O despite being identical in size to the original?
mbreion
Premium Member
Premium Member
Posts: 6
Joined: Mon Jun 21, 2010 2:28 am

have you verified the environment are equal ?

Post by mbreion »

Did you check the UVCONFIG file for both ?

This reminds me of the UVSYNC and SYNCALOC variables. Changing from AIX to linux may have changed the default for these and thus dropping the performance when volume goes up.


Sorry for the delay, didn't see the post was old.
Maybe it helps anyway.
Post Reply