Conversion from mpz to numpy longdouble is incorrect #507

pearu · 2024-08-25T18:01:40Z

As in the title.

Reproducer (using gmpy2 version 2.1.2):

>>> i = 5579686107214117131790972086716881
>>> numpy.longdouble(gmpy2.mpz(i))  # wrong result
5.5796861072141166625e+33
>>> numpy.longdouble(i)             # expected result
5.579686107214117132e+33

Notice that

>>> numpy.longdouble(numpy.double(i))
5.5796861072141166625e+33

that is, it looks like the conversion mpz->longdouble uses mpz->double->longdouble while it should use mpz->int->longdouble.

The text was updated successfully, but these errors were encountered:

skirpichev · 2024-08-25T22:53:39Z

I don't think it's a gmpy2 issue.

numpy.longdouble constructor just don't call __int__() dunder at all:

>>> import gmpy2, numpy
>>> class Spam:
...     def __int__(self):
...         return 123
... 
>>> numpy.longdouble(Spam())
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
TypeError: float() argument must be a string or a real number, not 'Spam'

skirpichev · 2024-08-26T04:32:21Z

Ah, I see. This will work if gmpy2 types will have __array__() dunder.

@pearu, is this The Right Way to coerce scalars like mpz to numpy datatypes?

pearu · 2024-08-26T07:04:15Z

@skirpichev , __array__ protocol certainly provides a way to coerce mpz to longdouble:

import gmpy2
import numpy

class BigIntArray:

    def __init__(self, obj):
        self.obj = obj

    def __array__(self, dtype):
        return numpy.array(int(self.obj), dtype=dtype)

i = 5579686107214117131790972086716881
    
x1 = numpy.longdouble(BigIntArray(i))
x2 = numpy.longdouble(BigIntArray(gmpy2.mpz(i)))
print(x1, x2)  # outputs 5.579686107214117132e+33 5.579686107214117132e+33

pearu · 2024-08-26T07:10:05Z

Notice that the __array__ protocol adds numpy dependency to gmpy2. Although, this could be made a runtime dependency by using:

    def __array__(self, dtype):
        import numpy
        return numpy.array(int(self.obj), dtype=dtype)

skirpichev · 2024-08-26T08:30:27Z

I think, that a hard dependence on the numpy is no-go.

Runtime dependency will a kinda work, but... probably this will be to slow. This might be a good idea for something like fmpz_mat (python-flint). But gmpy2 has only scalar types and I believe, that explicit type casts are better.

pearu · 2024-08-26T09:01:35Z

Runtime dependency will a kinda work, but... probably this will be to slow.

IMHO, this is acceptable as the issue is about silent incorrectness. In fact, repeated import is equivalent to dictionary access anyway:

In [8]: import numpy

In [9]: %timeit import numpy
64 ns ± 0.114 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)

In [10]: d = {"foo": 2}

In [11]: %timeit d['foo']
24.9 ns ± 0.124 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)

while creating longdouble is more expensive than import:

In [24]: %timeit numpy.longdouble(1)
319 ns ± 1.39 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

skirpichev · 2024-08-26T09:39:59Z

creating longdouble is more expensive than import

(Hmm, I would guess it's because npy_longdouble_from_PyLong() don't do it numerically.)

On another hand, why numpy doesn't call __int__() dunder, if it's provided? FYI: feature was implemented in numpy/numpy#9968.

pearu · 2024-08-26T09:44:27Z

On another hand, why numpy doesn't call __int__() dunder, if it's provided?

It should not call __int__ when mpz provides __float__. And it would not call __float__ when __array__ is going to be provided.

skirpichev · 2024-08-26T10:22:43Z

Sorry, I meant __index__(). This looks like python/cpython#120950.

Closes aleaxit#507

skirpichev · 2024-09-13T10:42:14Z

pr is ready for review: #514

skirpichev added a commit to skirpichev/gmpy that referenced this issue Sep 13, 2024

Add mpz.__array__() method to interact with numpy

46eb88d

Closes aleaxit#507

skirpichev mentioned this issue Sep 13, 2024

Add mpz.__array__() method to interact with numpy #514

Merged

skirpichev added a commit to skirpichev/gmpy that referenced this issue Sep 13, 2024

Add mpz.__array__() method to interact with numpy

a17c17e

Closes aleaxit#507

skirpichev added a commit to skirpichev/gmpy that referenced this issue Sep 13, 2024

Add mpz.__array__() method to interact with numpy

54fbfc8

Closes aleaxit#507

skirpichev added a commit to skirpichev/gmpy that referenced this issue Sep 13, 2024

Add mpz.__array__() method to interact with numpy

cf11349

Closes aleaxit#507

skirpichev added a commit to skirpichev/gmpy that referenced this issue Sep 13, 2024

Add mpz.__array__() method to interact with numpy

5099d35

Closes aleaxit#507

casevh closed this as completed in #514 Sep 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conversion from mpz to numpy longdouble is incorrect #507

Conversion from mpz to numpy longdouble is incorrect #507

pearu commented Aug 25, 2024 •

edited

Loading

skirpichev commented Aug 25, 2024

skirpichev commented Aug 26, 2024

pearu commented Aug 26, 2024

pearu commented Aug 26, 2024

skirpichev commented Aug 26, 2024

pearu commented Aug 26, 2024 •

edited

Loading

skirpichev commented Aug 26, 2024

pearu commented Aug 26, 2024 •

edited

Loading

skirpichev commented Aug 26, 2024

skirpichev commented Sep 13, 2024

Conversion from mpz to numpy longdouble is incorrect #507

Conversion from mpz to numpy longdouble is incorrect #507

Comments

pearu commented Aug 25, 2024 • edited Loading

skirpichev commented Aug 25, 2024

skirpichev commented Aug 26, 2024

pearu commented Aug 26, 2024

pearu commented Aug 26, 2024

skirpichev commented Aug 26, 2024

pearu commented Aug 26, 2024 • edited Loading

skirpichev commented Aug 26, 2024

pearu commented Aug 26, 2024 • edited Loading

skirpichev commented Aug 26, 2024

skirpichev commented Sep 13, 2024

pearu commented Aug 25, 2024 •

edited

Loading

pearu commented Aug 26, 2024 •

edited

Loading

pearu commented Aug 26, 2024 •

edited

Loading