Fix LAPACK non-orthogonality #842

antoine-levitt · 2023-04-03T12:30:28Z

Will not break! Will not break! It broke. (https://www.youtube.com/watch?v=qioOho4_ar0)

This fixes the sanity check failing for Float32 that @mfherbst has been seeing.

mfherbst · 2023-04-03T12:45:20Z

Nice. Thanks for fixing @antoine-levitt !

I wonder if we should avoid the re-orthogonalisation for Float64. A rough check with my recent timings suggests this can increase the cost of the RR by 10% in some cases (where the RR takes a few % of the total runtime).

antoine-levitt · 2023-04-03T12:58:28Z

If you've got the timings ready, can you check the impact of replacing F = eigen(Hermitian(XAX)) by F = Eigen(LAPACK.syev!('V', 'U', XAX)...)? This avoids the loss of orthogonality, but the solver is supposed to be slower.

antoine-levitt · 2023-04-03T13:04:24Z

I'd really like to keep doing the "right thing" by default as far as possible. I also have (moderate) hope that this fix will reduce the annoying "hey, LOBPCG has decided we need 100 iterations to converge this now". If it's only a couple of percents of a couple of percents, let's not bother...

We can also have a "degraded" faster setting that skips some orthogonalizations, but I'm not very comfortable with the tolerance fiddling...

jaemolihm · 2023-04-03T14:04:19Z

There's also zheevd which do not have an interface in LAPACK.jl. When I tested, it was the fastest for matrix size N > 100. I haven't tested the orthogonality, but it should be okay according to https://netlib.org/lapack/lawnspdf/lawn183.pdf

QR [SYEV] and DC [SYEVD] are the most accurate algorithms, measured both in terms of producing pairwise orthogonal eigenvectors and small residuals norms. MR [SYEVR] is less accurate but still achieves errors of size O(nε), where n is the dimension and ε is machine epsilon

mfherbst · 2023-04-03T14:33:15Z

Thanks for the input, both. I'll look into the different solvers wrt. timings and stability a bit when I get the time.

Regarding the current reorthogonalisation: My tests suggest it's indeed not something we need to worry (we're talking 100ms for calculations that take a few minutes).

antoine-levitt · 2023-04-04T13:34:20Z

OK, so I tested all three : syev, syevr, syevd (thanks @jaemolihm for sending me the syevd wrapper!). syevd is actually the most orthogonal, always keeping the vectors orthogonal to 1e-5, and often much less than that. Syevr is atrocious, very often producing 1e-4, and sometimes 1e-3! Syev is in the middle, staying within 1e-5 but hitting it more often than syevd. On my test case, without additional orthogonalizations, syevr made the assert crash; syev made the LOBPCG report non-convergence; syevd was completely fine.

The rayleigh-ritz step is negligible for small systems and only becomes significant for a large number of bands, where according to @jaemolihm syevd is the fastest. @jaemolihm said he's going to make a PR to julia with the syevd wrapper. So let's keep the ortho! in there, and when the PR to julia is in, use it when VERSION is adequate.

In my tests a few years back (almost 10 years now!), ELPA (https://elpa.mpcdf.mpg.de/) was the fastest direct eigensolver around; I wonder if that's still true. No idea about accuracy though.

mfherbst · 2023-04-04T14:34:01Z

So let's keep the ortho! in there, and when the PR to julia is in, use it when VERSION is adequate.

Sounds good. @jaemolihm feel free to keep us posted :).

~~syevd is divide-and-conquer while syev is QR, right?~~ (just saw the answer above ...)

By the way the test failure could be an issue in Julia's generic linear algebra library, but I have not investigated any further.

Fix LAPACK non-orthogonality

c139d43

jaemolihm mentioned this pull request Apr 5, 2023

Add syevd LAPACK Hermitian eigensolver JuliaLang/julia#49262

Merged

mfherbst added 5 commits April 6, 2023 23:23

small

c24d04a

Dispatch and random fixes

ae43633

Merge branch 'master' into lobpcg_fix

a2614f4

Fix up GPU a bit

11af950

up

1ec57ed

mfherbst merged commit 6a8c25c into master Apr 14, 2023

mfherbst deleted the lobpcg_fix branch April 14, 2023 11:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix LAPACK non-orthogonality #842

Fix LAPACK non-orthogonality #842

antoine-levitt commented Apr 3, 2023

mfherbst commented Apr 3, 2023

antoine-levitt commented Apr 3, 2023

antoine-levitt commented Apr 3, 2023

jaemolihm commented Apr 3, 2023

mfherbst commented Apr 3, 2023

antoine-levitt commented Apr 4, 2023

mfherbst commented Apr 4, 2023 •

edited

Loading

Fix LAPACK non-orthogonality #842

Fix LAPACK non-orthogonality #842

Conversation

antoine-levitt commented Apr 3, 2023

mfherbst commented Apr 3, 2023

antoine-levitt commented Apr 3, 2023

antoine-levitt commented Apr 3, 2023

jaemolihm commented Apr 3, 2023

mfherbst commented Apr 3, 2023

antoine-levitt commented Apr 4, 2023

mfherbst commented Apr 4, 2023 • edited Loading

mfherbst commented Apr 4, 2023 •

edited

Loading