EOF tests are failing in CI for Windows #463

kafitzgerald · 2023-09-12T14:49:43Z

Describe the bug
Several of the EOF tests are failing in CI for Windows. You can see this in PR #460 where Windows was added to the testing matrix.

Specifically, there appears to be a sign error in the resulting arrays for the EOF functions.

Expected behavior
Tests should pass.

OS:
Windows

Additional notes:
I only looked into this briefly and didn't see anything offhand.

It does look like the eofs package we depend upon supports Windows.

We probably need to sort this out before merging PR #460.

cyschneck · 2023-11-07T17:46:02Z

EOFS Documentation

eofs works on Python 2 or 3 on Linux, Windows or OSX

kafitzgerald · 2023-11-07T17:50:18Z

Here's a direct link to where the GeoCAT-comp eofs tests are failing for Windows in CI: https://github.com/NCAR/geocat-comp/actions/runs/6131275834/job/16642482498?pr=460

cyschneck · 2023-11-07T20:13:49Z

Windows Test Failures: 10 tests

FAILED test/test_stats.py::Test_eof::test_eof_00
FAILED test/test_stats.py::Test_eof::test_eof_deprecated
FAILED test/test_stats.py::Test_eof::test_eof_01
FAILED test/test_stats.py::Test_eof::test_eof_02
FAILED test/test_stats.py::Test_eof::test_eof_14
FAILED test/test_stats.py::Test_eof::test_eof_15
FAILED test/test_stats.py::Test_eof::test_eof_16
FAILED test/test_stats.py::Test_eof::test_eof_n_01
FAILED test/test_stats.py::Test_eof::test_eof_n_03
FAILED test/test_stats.py::Test_eof::test_eof_n_03_1

AssertionError: Arrays are not almost equal to 5 decimals shared among all test failures where the arrays are equal but with sign inverted

E       AssertionError:
E       Arrays are not almost equal to 5 decimals
E
E       Mismatched elements: 16 / 16 (100%)
E       Max absolute difference: 0.5
E       Max relative difference: 2.
E        x: array([[[0.25, 0.25, 0.25, 0.25],
E               [0.25, 0.25, 0.25, 0.25],
E               [0.25, 0.25, 0.25, 0.25],
E               [0.25, 0.25, 0.25, 0.25]]])
E        y: array([[[-0.25, -0.25, -0.25, -0.25],
E               [-0.25, -0.25, -0.25, -0.25],
E               [-0.25, -0.25, -0.25, -0.25],
E               [-0.25, -0.25, -0.25, -0.25]]])

cyschneck · 2023-11-08T19:29:13Z

stats.py: 244

eofs = solver.eofs(neofs=neofs, eofscaling=eofscaling)

solver defined stats.py:232

data, solver = _generate_eofs_solver(data,
                                         time_dim=time_dim,
                                         weights=weights,
                                         center=center,
                                         ddof=ddof)

_generate_eofs_solver defined stats.py:72

cyschneck · 2023-11-13T19:43:18Z

Potential source of issue: numpy (v. 1.23.5)

Running np.linalg.svd on Windows:

Python 3.11.6 | packaged by conda-forge | (main, Oct  3 2023, 10:29:11) [MSC v.1935 64 bit (AMD64)] on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> import numpy as np
>>> a = np.array([[2, 2], [1, 1]])
>>> u, s, Vh = np.linalg.svd(a, full_matrices=False)
>>> u
array([[-0.89442719, -0.4472136 ],
       [-0.4472136 ,  0.89442719]])
>>> s
array([3.16227766e+00, 1.10062118e-17])
>>> Vh
array([[-0.70710678, -0.70710678],
       [ 0.70710678, -0.70710678]])

Running np.linalg.svd on Linux:

Python 3.11.6 | packaged by conda-forge | (main, Oct  3 2023, 10:40:35) [GCC 12.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import numpy as np
>>> a = np.array([[2, 2], [1,1]])
>>> u, s, Vh = np.linalg.svd(a, full_matrices=False)
>>> u
array([[-0.89442719, -0.4472136 ],
       [-0.4472136 ,  0.89442719]])
>>> s
array([3.16227766, 0.        ])
>>> Vh
array([[-0.70710678, -0.70710678],
       [-0.70710678,  0.70710678]])

Across platforms, the output of S (Vectors with singular vectors) is different. On Windows S=array([3.16227766e+00, 1.10062118e-17]) and on Linux S=array([3.16227766, 0. ])

philipc2 · 2023-11-13T21:49:47Z

Potential source of issue: numpy (v. 1.23.5)

Running np.linalg.svd on Windows:

Python 3.11.6 | packaged by conda-forge | (main, Oct  3 2023, 10:29:11) [MSC v.1935 64 bit (AMD64)] on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> import numpy as np
>>> a = np.array([[2, 2], [1, 1]])
>>> u, s, Vh = np.linalg.svd(a, full_matrices=False)
>>> u
array([[-0.89442719, -0.4472136 ],
       [-0.4472136 ,  0.89442719]])
>>> s
array([3.16227766e+00, 1.10062118e-17])
>>> Vh
array([[-0.70710678, -0.70710678],
       [ 0.70710678, -0.70710678]])

Running np.linalg.svd on Linux:

Python 3.11.6 | packaged by conda-forge | (main, Oct  3 2023, 10:40:35) [GCC 12.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import numpy as np
>>> a = np.array([[2, 2], [1,1]])
>>> u, s, Vh = np.linalg.svd(a, full_matrices=False)
>>> u
array([[-0.89442719, -0.4472136 ],
       [-0.4472136 ,  0.89442719]])
>>> s
array([3.16227766, 0.        ])
>>> Vh
array([[-0.70710678, -0.70710678],
       [-0.70710678,  0.70710678]])

Across platforms, the output of S (Vectors with singular vectors) is different. On Windows S=array([3.16227766e+00, 1.10062118e-17]) and on Linux S=array([3.16227766, 0. ])

For the resulting S, are you sure the numbers are different? It looks like on Windows its printing all the significant figures, while on Linux it truncates it when printing.

cyschneck · 2023-11-13T22:12:04Z

Using the same data array on both Linux and Windows: [[2, 2], [1,1]]

Windows:

Python 3.11.6 | packaged by conda-forge | (main, Oct  3 2023, 10:29:11) [MSC v.1935 64 bit (AMD64)] on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> import numpy as np
>>> d = np.array([[2, 2], [1, 1]])
>>> u, s, vH = np.linalg.svd(d, full_matrices=False)
>>> vH
array([[-0.70710678, -0.70710678],
       [ 0.70710678, -0.70710678]])

Linux:

Python 3.11.6 | packaged by conda-forge | (main, Oct  3 2023, 10:40:35) [GCC 12.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import numpy as np
>>> d = np.array([[2, 2], [1, 1]])
>>> u, s, vH = np.linalg.svd(d, full_matrices=False)
>>> vH
array([[-0.70710678, -0.70710678],
       [-0.70710678,  0.70710678]])

The second values are flipped in Linux from Windows from [ 0.70710678, -0.70710678] to [-0.70710678, 0.70710678]

cyschneck · 2023-11-14T19:27:51Z

Seems like this is a result of when having duplicate singular values the SVD is not unique

When you have duplicate singular values, as you do here, the SVD is not unique. The vectors associated with the duplicate singular values can be rotated freely. Different versions of the underlying linear algebra library may take different paths and return different choices in such cases. Both versions of the returned matrices are correct.

This can be also seen when using scipy.linalg.svd that gives the same output as np.linalg.svd

Windows:

Python 3.11.6 | packaged by conda-forge | (main, Oct  3 2023, 10:29:11) [MSC v.1935 64 bit (AMD64)] on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> import numpy as np
>>> import scipy
>>> d = np.array([[2, 2], [1, 1]])
>>> u, s, vH = scipy.linalg.svd(d, full_matrices=False)
>>> vH
array([[-0.70710678, -0.70710678],
       [ 0.70710678, -0.70710678]])

Linux:

Python 3.11.6 | packaged by conda-forge | (main, Oct  3 2023, 10:40:35) [GCC 12.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import numpy as np
>>> import scipy
>>> d = np.array([[2, 2], [1, 1]])
>>> u, s, vH = scipy.linalg.svd(d, full_matrices=False)
>>> vH
array([[-0.70710678, -0.70710678],
       [-0.70710678,  0.70710678]])

The magnitude of values is correct, but the signs can differ

In order to check that the SVD is correct you need to check that the matrices u and v are indeed unitary and that x = u @ np.diag(s) @ vh. If both conditions hold, than the SVD is correct, otherwise it isn't.

SVD of a non-square matrix is not unique in U and V. Even if you have a square matrix with non-zero, non-degenerate singular values, singular vectors in U and V are only unique up to a sign factor

cyschneck · 2023-11-15T19:39:17Z

From the NCL Graphics: EOFS

Note on signs of EOF analysis (contributed by Andrew Dawson, U. East Anglia)

EOFs are eigenvectors of the covariance matrix formed from the input data. Since an eigenvector can be multiplied by any scalar and still remain an eigenvector, the sign is arbitrary. In a mathematical sense the sign of an eigenvector is rather unimportant. This is why the EOF analysis may yield different signed EOFs for slightly different inputs. Sign only becomes an issue when you wish to interpret the physical meaning (if any) of an eigenvector.

You should approach the interpretation of EOFs by looking at both the EOF pattern and the associated time series together. For example, consider an EOF of sea surface temperature. If your EOF has a positive centre and the associated time series is increasing, then you will interpret this centre as a warming signal. If your EOF had come out the other sign (ie. a negative centre), then the associated time series would also be the opposite sign and you would still interpret the centre as a warming signal.

In essence, the sign flip does not change the physical interpretation of the result. Hence, it is up to you to choose which sign to associate with your EOF patterns for visualisation (remembering that any sign change to an EOF must be applied to the associated time series also). Usually you would simply adjust the sign so that all your EOF patterns with the same physical interpretation also look the same

It is possible that the slightly different way that Windows rounds and handles numbers on the back end is resulting in slightly different inputs that are yielding different signed EOFs. But it appears that the sign of the array might be irrelevant to the stats.py output based on how EOFs work so the tests might currently be too strict

kafitzgerald added the bug Something isn't working label Sep 12, 2023

kafitzgerald added this to the Continuous Integration milestone Sep 12, 2023

anissa111 modified the milestones: Upstream Testing Improvement, Windows Compatibility Sep 13, 2023

anissa111 assigned cyschneck and anissa111 Nov 2, 2023

This was referenced Nov 15, 2023

unable to install geocat in windows #131

Closed

EOF Unsigned Vector Tests #516

Merged

anissa111 closed this as completed in #516 Nov 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EOF tests are failing in CI for Windows #463

EOF tests are failing in CI for Windows #463

kafitzgerald commented Sep 12, 2023

cyschneck commented Nov 7, 2023

kafitzgerald commented Nov 7, 2023

cyschneck commented Nov 7, 2023

cyschneck commented Nov 8, 2023

cyschneck commented Nov 13, 2023 •

edited

Loading

philipc2 commented Nov 13, 2023

cyschneck commented Nov 13, 2023 •

edited

Loading

cyschneck commented Nov 14, 2023 •

edited

Loading

cyschneck commented Nov 15, 2023 •

edited

Loading

EOF tests are failing in CI for Windows #463

EOF tests are failing in CI for Windows #463

Comments

kafitzgerald commented Sep 12, 2023

cyschneck commented Nov 7, 2023

kafitzgerald commented Nov 7, 2023

cyschneck commented Nov 7, 2023

cyschneck commented Nov 8, 2023

cyschneck commented Nov 13, 2023 • edited Loading

philipc2 commented Nov 13, 2023

cyschneck commented Nov 13, 2023 • edited Loading

cyschneck commented Nov 14, 2023 • edited Loading

cyschneck commented Nov 15, 2023 • edited Loading

cyschneck commented Nov 13, 2023 •

edited

Loading

cyschneck commented Nov 13, 2023 •

edited

Loading

cyschneck commented Nov 14, 2023 •

edited

Loading

cyschneck commented Nov 15, 2023 •

edited

Loading