Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fixed Float16 from Float64 and BigFloat #42837

Merged
merged 7 commits into from
Nov 4, 2021

Conversation

oscardssmith
Copy link
Member

previously Float16(::Float64) and Float16(::BigFloat) had double rounding issues.

@oscardssmith oscardssmith added bugfix This change fixes an existing bug bignums BigInt and BigFloat float16 labels Oct 28, 2021
@KristofferC KristofferC added the needs tests Unit tests are required for this change label Oct 28, 2021
@oscardssmith
Copy link
Member Author

Testing these conversions are really hard because the way you generally test math is computing in higher precision and checking if the result is the same. I should probably add tests for a few likely problematic values like Float16(1.0+eps(Float16)/2 + eps(Float64)). Any other tests worth doing for this?

@vchuravy vchuravy added backport 1.6 Change should be backported to release-1.6 backport 1.7 labels Oct 28, 2021
@gbaraldi
Copy link
Member

gbaraldi commented Oct 29, 2021

Maybe one test for previous float, not sure if it would add coverage or just add clutter.

@oscardssmith
Copy link
Member Author

Tests added. This still has a bug for some values less than nextfloat(Float16(0)), but it is a major improvement.

base/mpfr.jl Outdated
function Float16(x::BigFloat) :: Float16
res = Float32(x)
resi = reinterpret(UInt32, res)
if (resi&0x7fffffff) < 0x38800000
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe this starts to need some comments now. It's a bit of magic number soup now.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good point. that's just if Float16(res) is subnormal.

Copy link
Member

@KristofferC KristofferC Oct 30, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

And this is more efficient than issubnormal(Float16(res))? Does it necessarily have to be?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah. this is only 2 instructions while converting to float16 is relatively expensive.

@oscardssmith oscardssmith removed the needs tests Unit tests are required for this change label Oct 30, 2021
@oscardssmith
Copy link
Member Author

oscardssmith commented Nov 1, 2021

Should we merge this as is? This is already a major improvement over current status and I'm not 100% sure how to fix the last remaining case the edge between nextfloat(Float16(0)) and 0

base/mpfr.jl Show resolved Hide resolved
@vtjnash vtjnash added the merge me PR is reviewed. Merge when all tests are passing label Nov 1, 2021
@vtjnash
Copy link
Member

vtjnash commented Nov 1, 2021

Do you mean all of 0.0:floatmin(Float16) or just 0.0:nextfloat(Float16(0))

@oscardssmith
Copy link
Member Author

yeah. Fixed.

@oscardssmith
Copy link
Member Author

The only test error remaining appears to be openblas threading issue.

src/runtime_intrinsics.c Outdated Show resolved Hide resolved
@oscardssmith
Copy link
Member Author

last remaining bug fixed! from my perspective, this is ready to merge.

@KristofferC
Copy link
Member

How is it ensured that these tests both test the C version and the Julia version?

@oscardssmith
Copy link
Member Author

the c version is float64. the Julia version is bigfloat

Copy link
Member

@vtjnash vtjnash left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I helped co-author this, so I will approve here too.

@oscardssmith
Copy link
Member Author

Any objections to merging this?

@oscardssmith oscardssmith merged commit 707c57c into JuliaLang:master Nov 4, 2021
@oscardssmith oscardssmith deleted the fix-big-to-float16 branch November 4, 2021 14:22
@DilumAluthge DilumAluthge removed the merge me PR is reviewed. Merge when all tests are passing label Nov 7, 2021
@KristofferC
Copy link
Member

Needs a manual backport towards 1.7.

@oscardssmith
Copy link
Member Author

What branch should I target the PR to?

@KristofferC
Copy link
Member

backports-release-1.7.

@oscardssmith
Copy link
Member Author

backported: #43092 is the backport

@KristofferC KristofferC mentioned this pull request Jan 10, 2022
50 tasks
LilithHafner pushed a commit to LilithHafner/julia that referenced this pull request Feb 22, 2022
* fixed Float16 from Float64 and BigFloat. Many thanks to Jamison.
LilithHafner pushed a commit to LilithHafner/julia that referenced this pull request Mar 8, 2022
* fixed Float16 from Float64 and BigFloat. Many thanks to Jamison.
@KristofferC
Copy link
Member

Also needs a manual backport towards backports-release-1.6.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport 1.6 Change should be backported to release-1.6 bignums BigInt and BigFloat bugfix This change fixes an existing bug float16
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants