CalculiX with Intel's MPI cluster sparse solver

feacluster · November 21, 2022, 1:27am

I think it looks correct. I get the same files with differences. I think most are due to differences in the eigensolver and natual frequencies. See discussion on that topic here:

What kind of google cloud machines are you going to run on? To see speedup you will need to use two dedicated machines with all the available cpus on each .

akhmeteli · November 21, 2022, 2:11am

Thank you very much for the explanation.

As for the google cloud machines, I have yet to decide what exactly I need. So far I just wanted to try your install procedure, so I set up an instance group where the number of instances could change from 1 to 10. The instances were E2 machines, the boot disk was 50 GB (oneAPI Base required at least 23 GB). At the moment the execution time is not the priority, but I need a lot of memory. My desktop at home has only 64 GB of memory, and it is not enough for my problem.

jbr · November 25, 2022, 5:58am

Any plans to port it to v2.20?

Only works with v2.18 . Will update it for newer versions if there is interest.

feacluster · November 25, 2022, 6:57pm

No plans yet unless someone has a compelling reason Were you able to run this on a cluster and see any speedup?

akhmeteli · December 8, 2022, 7:20am

So I have run a buckling job in a google cloud cluster at last: one instance, custom, N2 machines, 8 vCPU, 120 GB RAM, 96 GB boot disk, at least 95 GB RAM was used, and it took about 100 minutes.

Thank you very much for the script and your help!

feacluster · December 8, 2022, 4:02pm

That is good. But if just running on one instance then probably no need to run the mpi version. The regular executable with pardiso solver should work just as well.

akhmeteli · December 8, 2022, 4:09pm

It does not matter that there are multiple processors?

feacluster · December 8, 2022, 4:26pm

Both the MPI and non-MPI version can use all the processors. The only difference is the MPI version can use multiple hosts also. So if you install option 2 below then you are limited to all the processors on one host.

(1) Spooles ( Not recommended. 2-3X slower than Pardiso and can not solve models with more than a million degrees of freedom )
(2) Pardiso ( Must have the Intel compiler. If not, it is available for free from Intel.com. Does not require administrative privelages )
(3) Pardiso MPI ( Same requirements as above, but needs HPC kit also. Only works with v2.18 . Will update it for newer versions if there is interest. )

MichaelPE · December 8, 2022, 5:21pm

I might note I have run problems requiring more memory than I have by increasing my available page file size and using an SSD for the page file. I have a very fast SSD, but watching I/O bandwidth has indicated that is probably not necessary. Most of the FEM problems only access modest portions of the memory at one time or at nearly the same time. For my 64 GB machine working virtual memory set sizes up to 160% of physical memory have been usable. Larger starts to have a lot of paging slow things down a lot. Pastix does not use memory as efficiently as Pardiso, so problems don’t work as well when they get larger than physical memory, and they reach that limit sooner. I have not been using 18 yet.

feacluster · November 25, 2024, 7:47pm

Just bumping this if anyone wants to test this. Haven’t had much interest so I am guessing nobody runs CalculiX on a cluster.

The speedup as shown in the first post is decent , nearly 50% faster when using 4 machines instead of 1.

NorbertH · November 26, 2024, 11:38am

Have you tried to get ccx 2.22 to work?

feacluster · November 26, 2024, 3:39pm

Do you need 2.22 with the MPI capabilities on a cluster or just single multi-core machine?

NorbertH · November 26, 2024, 4:24pm

Both would be good.
I would ship the single multicore machine version with the cubit calculix component and could use the cluster version for myself.

feacluster · November 26, 2024, 5:52pm

I updated the script so 2.22 should now install for the single multicore machine. But the MPI version still will only work with 2.18. It will take some work to update that to 2.22. First I would need to see the use-case, i.e what features do you need in 2.22 that are not in 2.18? And what speedup are you getting on your cluster.

NorbertH · November 27, 2024, 2:21pm

I will try the updated script over the weekend.

I don’t have a usable cluster currently. Just a bunch of old workstations. i am currently making the budget for next year and would like to invest a bit in hardware.

For the use case. I currently did make a try to get a workflow with openradioss and cubit and want to rebuild the example in calculix. The Johnson cook material is only available with 2.21.
The more important thing is i need a bit more power for running fsi problems using openfoam, calculix and precice.

oxygenliu · December 2, 2024, 6:58am

Thank you for sharing!

ccx_2.22_MT and ccx_2.18_MPI work smoothly on my workstation with Intel Core i7 CPU 10875 processer.

NorbertH · December 19, 2024, 2:29pm

@feacluster have you got ccx_2.22_MT with the intel libraries already linked?

just like your ccx_2.19_MT

feacluster · December 20, 2024, 3:08am

Which linux are you running?

NorbertH · December 20, 2024, 5:59am

currently ubuntu 22.04 but i will switch to 24.04 soon

Topic		Replies	Views
Using the feacluster.com ccx install script with Intel HPC Toolkit v2024.1.0	36	735	May 22, 2024
Any suggestion for parallel compilation of CalculiX	0	707	February 3, 2021
Calculix i8 for linux	4	312	November 17, 2023
Can Calculix run across multiple nodes?	10	848	February 27, 2025
CalculiX using SPOOLES with MPI	5	2068	June 23, 2021

CalculiX with Intel's MPI cluster sparse solver

Related topics