klheureux said:
The new system shows 8 processors in the task
manager. I had affinity set to all 8 while running just Pro-E and
Mechanica.
The run on my old system took 10.87 hours to run, on the new system
3.54 hours
wskunz said:
On our existing Pro/E systems, Dell Precision 370,
3.4 GHz P4, 2GB RAM, nVidia 3400, XP Pro 32bit, WF2 M050. A test
analysis took 2.5 hours.
The same hardware running XP64 Pro & WF2 took 45 minutes.
On a new Dell Precision 670, 2-3.6 GHz Xeons, 6 GB RAM, nvidia 3400,
XP64 Pro & WF2 took 20 minutes.
Very interesting! I don't quite know what to make of these numbers, it
seems very strange indeed that merely the transition to 64-bit systems
would make such a big difference. I know that the x86-64 ISA allows for
more data registers and tricks like that, but usually it only accounts for a
speedup in the magnitude of 10-20% - they
SAY. I myself have
noticed a 25-40% speedup in a benchmark I ran on two systems (one 32-
bit Linux and the other a 64-bit) but that test wasn't very scientific.
So while 64-bitness can account for a lot, I fail to see how it could speed
up computations to less than
a third, unless the rig was starved for
RAM in the first setting? Boggles my mind!
klheureux: It is a widely accepted and rather sad fact that Intels
Hyperthreading is decelerating performance rather than accelerating it in
most cases, so I suggest that you disable it in the BIOS and see if you gain
another 10% or so - despite the fact that it will only look like 4 CPUs in
the Task Manager. Might make a difference to the positive, might not.
I did a quick check of the rigs the other guys use around here (while I
have a single-CPU P4 they have dual Xeons) and it seems as if Pro/
Mechanica does indeed utilize multi-CPU capabilities, but it doesn't do it
very well. The combined CPU utilization was wavering at some 50-80%.
Ideally, they should be smacked up at 100% all the time.
I'm underway of talking the management here into letting me build a
quad Opteron, so if it works out then I might be able to post a nice
performance comparison myself. =] I'll even try a few ANSYS runs to see if
it utilizes multithreading more efficient than Pro/M does. Since we are
pressed for time and have ANSYS in house, it might actually be rational to
use it in place of Pro/M.
Thanks for your inputs!