CalculiX with AOCL solver

xyont · September 20, 2024, 11:33am

hi, recently i’m looking of matrix library solver for AMD cpu since experiences MKL seems not optimized. Several tests from external references (i.e Mathworks) have shown faster using AMD Optimizing CPU Libraries (AOCL) than Intel MKL (Pardiso) about 20% speed up. Hopefully, this solver will be available in the future version of CalculiX, best.

jbr · September 22, 2024, 4:52pm

@xyont, are these just optimized blas, lapack, libm, etc, for AMD? Do they have an actual sparse solver like pardiso? Remember that the pardiso solver that ships with MKL is an older version of the solver. The newer version is faster compared to the MKL versions - Panua Technologies

xyont · September 22, 2024, 5:35pm

seems it has own sparse solver when looking from documentation and project. The link of Pardiso from Panua is an improvement of original versions, the library is proprietary commercial for both end user and developer, but Intel MKL available free for user. Maybe this is the reason to not become popular as Intel does, even shown and reported faster than MKL. In contrast, AOCL is opensource project by AMD, it has been used by many also, e.g Matlab, Ansys, LS Dyna, MSC Nastran and Comsol,

c3d10 · December 3, 2024, 12:01am

Did you ever find any info on using AOCL with calculix? I was wondering the same thing.

feacluster · December 4, 2024, 6:36pm

I could look into it, but not sure what kind of speedup to expect. If just 10% or less then not sure it is worth the effort. And not sure how many people use AMD cpus?

jbr · December 5, 2024, 2:05am

Yeah… I think we have to find a better mechanism to include new solvers so that we could test different options for appropriate models. I would love to test petsc, trilinos, mumps, superlu, suitesparse/umfpack, but I haven’t had the chance to work on this.

xyont · December 5, 2024, 6:02am

as comparison, some reported it has performed better in Ansys (link)

Ansys Mechanical with AMD optimized AOCL BLIS deliver a geomean speedup of 1.26x with gains as much as 2.12x

rsmith · December 5, 2024, 6:26am

If you do find time to work on that, please choose solvers that “match” CalculiX itself;

free software,
runs on all platforms that CalculiX runs on,
[added 20241206] written in C or Fortran for easy integration.

If the source code for the solver is not available, finding bugs becomes a lot harder. And I suspect very few people will choose to spend thousands of euros on a solver.

Furthermore, linking CalculiX with a proprietary library violates the license that CalculiX is distributed under.

Added 20241206:

Solvers written in Fortran or C are probably easiest to integrate.
Ideally, one should profile the existing and new solvers before investing much time in integration!

jbr · December 5, 2024, 4:22pm

That’s exactly what I had in mind and those are fair points for advancing the use of ccx.

100% agree with you there.

I’ll need to do a deep dive into the different licenses since each of these solvers operates under a unique license. I’m leaving the links here for future reference to review them thoroughly. These might also be useful for anyone else exploring the same topic:

NorbertH · December 5, 2024, 9:27pm

Don’t forget MFEM. That’s a great library and already got the solvers implemented.

https://mfem.org/features/#built-in-solvers

jbr · December 6, 2024, 12:48am

They use some of the ones I already mentioned, but I forgot about hypre, certainly worth exploring:

hypre License

c3d10 · December 10, 2024, 1:45am

I use AMD CPU’s (looking to upgrade to Ryzen 9 soon) but I’m sure they’re not all that common for HPC-type applications

c3d10 · December 10, 2024, 1:46am

Agreed 110% on the profiling of solvers. I feel this is a common thing though (how many people need sparse equation solvers and how many sparse solvers are out there - quite a few), are there standard benchmarks that you’ve seen available for these options?

feacluster · December 11, 2024, 7:30pm

I did some reading on AOCC sparse solver. Seems it is just an interative solver, not direct? I don’t believe many users are using iterative solvers with Calculix. That is common in CFD where all element are the same solid type. See:

AOCL-Sparse

If people are using iterative solvers then there are probably better ones to explore first before AOCC. Something with MPI capability specifically.

c3d10 · December 11, 2024, 8:19pm

Interesting, why not use iterative solvers?

In my experience (commercial mechanical FEA tools for linear/nonlinear statics and modal dynamics), direct solvers are better for smaller problems and iterative solvers are more important for larger ones.

feacluster · December 11, 2024, 8:22pm

Iterative solvers only work if your entire mesh is all solids. If there are any beams, rigids, shells etc. then they don’t converge.

c3d10 · December 11, 2024, 9:41pm

Ah, interesting! I did some reading and this page helped clear up my confusion:
https://classes.engineering.wustl.edu/2009/spring/mase5513/abaqus/docs/v6.6/books/usb/default.htm?startat=pt03ch06s01aus39.html

Thanks!

xyont · December 22, 2024, 5:06am

AOCL is already implemented by many, maybe Ansys (Mechanical) user can confirm this limitation if true.

rsmith · December 22, 2024, 12:21pm

My feeling is that “standard benchmark” is somewhat of an oxymoron.

For benchmarking I tend to use whichever problems are taxing my current hardware.

Up until recently the iterative solver and SPOOLES were the only ones available on FreeBSD. Since the problems that I tend to work on are often beam-like, the iterative solver is a poor fit. SPOOLES works well enough even though I sometimes have to make the mesh coarse enough to fit into RAM. Not that big of an issue when using regular meshes of C3D20 or C3D10. Now I’ve built CalculiX with PaStiX, I have found it to be often but not always faster than SPOOLES. I haven’t done a proper comparison of memory use betewwn PaStiX and SPOOLES.

xyont · December 23, 2024, 12:37am

mostly FE code benchmarked for element formulation and type of analysis, only few doing for the solver performance. Previously i read some report from TNO Diana about MKL Pardiso, CalculiX about Spooles or iterative and later is PaStiX, also from Ansys (Mechanical) about AOCL.

Topic		Replies	Views
Calculix BEGINING	41	1564	August 15, 2023
CalculiX with Intel's MPI cluster sparse solver	38	2623	December 20, 2024
CalculiX speed up (interesting websites)	9	1479	August 13, 2021
CalculiX and PARDISO (Windows)	7	532	January 23, 2024
Inquiry About Solvers Linked in CalculiX Linux Executable	2	161	June 12, 2024

CalculiX with AOCL solver

Related topics