Add FloatPrinter based on Grisu3 #4333

will · 2017-04-24T04:30:45Z

This improves the speed of transforming floats to their string
representation. It is based on the 2004 paper "Printing Floating-Point
Numbers Quickly and Accurately with Integers" by Florian Loitsch[1].

Most of the code is a port from the BSD-licensed C++ project
"double-conversion"[2], which was extracted from the V8 engine.

The Grisu3 algorithm is fast because it deals only with fixed-sized
integer arithmetic. It takes advantage extra bits leftover from the
53-bit significand in a 64 bit number to help find the optimal string
representation. However this only works for 95.5% of floats and it
rejects the remaining 0.5%. Rejected numbers still need to be printed
with some other, slower method.

1: http://www.cs.tufts.edu/~nr/cs257/archive/florian-loitsch/printf.pdf
2: https://github.com/google/double-conversion

Previous issues #4308 #2220

This improves the speed of transforming floats to their string representation. It is based on the 2004 paper "Printing Floating-Point Numbers Quickly and Accurately with Integers" by Florian Loitsch[1]. Most of the code is a port from the BSD-licensed C++ project "double-conversion"[2], which was extracted from the V8 engine. The Grisu3 algorithm is fast because it deals only with fixed-sized integer arithmetic. It takes advantage extra bits leftover from the 53-bit significand in a 64 bit number to help find the optimal string representation. However this only works for 95.5% of floats and it rejects the remaining 0.5%. Rejected numbers still need to be printed with some other, slower method. 1: http://www.cs.tufts.edu/~nr/cs257/archive/florian-loitsch/printf.pdf 2: https://github.com/google/double-conversion

will · 2017-04-24T04:32:28Z

edit: This patch now has Float32 support, disregard the rest of this comment

From #4308

I tried to cast to Float64 in the current algorithm but the float changes its value and the end result is different than the original. I don't understand why, but that's what happened. You could probably try it with grisu3 and see what happens. If it works, we should do it :-)

I ran into the same thing. I'm going to look at Float32s can be added by using https://github.com/google/double-conversion/blob/9ed0dec708a18f0bbdf1a1edaee1b6b86be2043a/double-conversion/ieee.h#L263 but I'll do that in a separate PR if it works

Sija · 2017-04-25T01:16:18Z

Almost there, there's just one failing formatter check:

Error: formatting 'src/float_printer.cr' produced changes

bcardiff · 2017-04-25T15:03:19Z

@will what was the problem with -NaN? it is a valid float -NaN AFAIK.

I haven't review the code, I am asking just because commit message says remove -NaN test.

will · 2017-04-25T15:16:03Z

@bcardiff it was working locally, but was failing on travis. I wasn't actually sure if -NaN was a valid float, and assumed if it was a platform issue if it was failing on travis.

If -NaN is valid, I can try spinning up an instance and try and debug why it was printing as NaN on travis.

will · 2017-04-25T18:25:04Z

["nan", "-nan"].each {|s| a = s.to_f; puts [s, pointerof(a).as(UInt64*).value.to_s(16)]}

on Linux 4.10.10-200.fc25.x86_64 #1 SMP Thu Apr 13 01:11:51 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

["nan", "7ff8000000000000"]
["-nan", "7ff8000000000000"]

on Darwin 16.5.0 Darwin Kernel Version 16.5.0: Fri Mar 3 16:52:33 PST 2017; root:xnu-3789.51.2~3/RELEASE_X86_64 x86_64

["nan", "7ff8000000000000"]
["-nan", "fff8000000000000"]

It looks like crystal on linux is not setting the negative bit. I wonder though if this is a bug in String#to_f

bcardiff · 2017-04-25T18:42:48Z

Crystal String#to_f relies on LibC.strtod. I read that the NaN treatment differs from platform to platform.

The only way I was able to build a -NaN in osx and crystal 0.22.0 is to parse a -NaN as you did

fs = [0.0 / 0.0,
-0.0 / 0.0,
0.0 / -0.0,
-0.0 / -0.0,
"NaN".to_f64,
"-NaN".to_f64]

fs.each do |f|
  {f , f.to_s, pointerof(f).as(UInt64*).value.to_s(16), f.nan? }
  # 5 times => nan.0	"nan.0"	"7ff8000000000000" true
  # 1 time  => nan.0	"nan.0"	"fff8000000000000" true
end

From https://linux.die.net/man/3/snprintf it seems that -NaN's are not defined in SUSv2.

I now notice that the test_str spec you removed was original introduced by you. Maybe we are just fine with unsigned NaN.

will · 2017-04-25T18:50:57Z

I updated the test to use the memory representation of -nan and nan. This way, even if it's unlikely to come across a -NaN on linux, the tests should work cross platform.

Sija · 2017-04-29T19:44:32Z

spec/std/float_printer/diy_fp_spec.cr


    fp.exp.should eq -0x3FF - 52 + 1
    # This is denormal, so no hidden bit
    fp.frac.should eq 1
  end

+  it "converst min f32" do


typo: converst -> converts

will · 2017-05-01T01:47:18Z

Just for fun, wanted to bench this in a different way. This is just the most straightforward way in each lang. I wouldn't be surprised if there were fancy ways to eek out performance here and there, but that really wasn't the point

This is the average speed it takes to generate 1Gb of floats with newlines to stdout

./crystal_old | pv -Ss 1g > /dev/null
   1GiB 0:01:39 [10.2MiB/s] [=================>] 100%
./crystal_new | pv -Ss 1g > /dev/null
   1GiB 0:00:51 [19.9MiB/s] [=================>] 100%
./c_float | pv -Ss 1g > /dev/null
   1GiB 0:00:39 [  26MiB/s] [=================>] 100%
ruby f.rb | pv -Ss 1g > /dev/null
   1GiB 0:01:39 [10.3MiB/s] [=================>] 100%
./go_float | pv -Ss 1g > /dev/null
   1GiB 0:01:15 [13.6MiB/s] [=================>] 100%
./rust_float | pv -Ss 1g > /dev/null
   1GiB 0:01:19 [12.8MiB/s] [=================>] 100%

# crystal complied —release
a = 1.0
loop { puts a += 1.1 }

// C
/// compiled with -O on LLVM version 8.1.0 (clang-802.0.38)
#include <stdio.h>

int main() {
  double a = 1.0;
  for (;;) {
    printf("%.17g\n", a);
    a = a + 1.1;
  }
}

# ruby v3.4.1p111
a = 1.0
loop { puts a += 1.1 }

// go 1.8.1
package main
import "fmt"
func main() {
  a := 1.0
  for {
    fmt.Println(a);
    a += 1.1;
  }
}

// rust 1.17.0, built with -O
fn main() {
  let mut a = 1.0;
  loop {
    println!("{}",a);
    a += 1.1;
  }
}

I wanted to also do node because grisu3 is used in V8, but I had problems with it:

a = 1.0
for (;;) {
  console.log(a);
  a += 1.1
}

node f.js | pv -Ss 1g > /dev/null
^C37MiB 0:02:11 [ 0 B/s] [=> ] 13% ETA 0:14:04

It was at 0B/s when I killed it there, and it would switch from like 200Kb/s to 0 after a while, something is obviously wrong here, but eh.

Fryguy · 2017-05-02T21:14:58Z

Minor, but would it be a better file organization to have the class be nested under Float instead of a new top-level class? (i.e. Float::Printer instead of FloatPrinter)

asterite · 2017-06-04T14:44:42Z

@will Thank you so much for this!!

After this I will do a few minor changes, like probably move FloatPrinter inside Float, and introduce an Object#bitcast method to avoid all those pointerof(var).as(T*).value (which can be used in a few other places too).

RX14 · 2017-06-04T14:53:24Z

@asterite can you explain the bitcast name? It doesn't seem to fit to me.

asterite · 2017-06-04T18:16:39Z

@RX14 LLVM uses bitcast. It casts an object to another by just using the same bits. C++ uses reinterpret_cast. But since we don't have other casts (well, as) I think that's a good name.

asterite · 2017-06-04T18:16:48Z

Any other suggestion, though?

RX14 · 2017-06-04T18:23:38Z

How about unsafe_as? It's basically as but works on a lot more types, because it's unsafe.

asterite · 2017-06-04T18:33:07Z

Hmmm... I think I like that name :-)

For example you can't do 1.as(Float32), so I was going to say that it's a bit strange that 1.unsafe_as(Float32) would work, but it makes sense because it's reinterpreting the value as another type, even though there's no "safe" way to make that work.

RX14 · 2017-06-04T18:47:08Z

@asterite yes, that was exactly what I was trying to get at.

will added 2 commits April 23, 2017 20:06

FloatPrinter: make docs and interface more consistent

c0dee22

will mentioned this pull request Apr 24, 2017

Faster Float to String #4308

Closed

will force-pushed the grisu3 branch from c0de410 to c0de4b0 Compare April 25, 2017 14:38

FloatPrinter: fix -NaN test, fix formatting

c0de300

will force-pushed the grisu3 branch from c0de4b0 to c0de300 Compare April 25, 2017 18:27

Sija reviewed Apr 29, 2017

View reviewed changes

will added 2 commits April 29, 2017 12:46

FloatPrinter: add Float32 support

c0deb1f

FloatPrinter: add license to specs

c0dee23

will force-pushed the grisu3 branch from c0def35 to c0dee23 Compare April 29, 2017 19:50

asterite merged commit 32f583f into crystal-lang:master Jun 4, 2017

asterite added kind:feature topic:stdlib labels Jun 4, 2017

mverzilli added this to the Next milestone Jun 4, 2017

asterite added a commit that referenced this pull request Jun 4, 2017

Rename FloatPrinter to Float::Printer. Related to #4333

4d07680

Sija referenced this pull request Jun 27, 2017

Crystal 0.23.0 Changelog

3c3d3e2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add FloatPrinter based on Grisu3 #4333

Add FloatPrinter based on Grisu3 #4333

will commented Apr 24, 2017

will commented Apr 24, 2017 •

edited

Loading

Sija commented Apr 25, 2017

bcardiff commented Apr 25, 2017

will commented Apr 25, 2017

will commented Apr 25, 2017

bcardiff commented Apr 25, 2017

will commented Apr 25, 2017

Sija Apr 29, 2017

will Apr 29, 2017

will commented May 1, 2017

Fryguy commented May 2, 2017

asterite commented Jun 4, 2017

RX14 commented Jun 4, 2017 •

edited

Loading

asterite commented Jun 4, 2017

asterite commented Jun 4, 2017

RX14 commented Jun 4, 2017 •

edited

Loading

asterite commented Jun 4, 2017

RX14 commented Jun 4, 2017

Add FloatPrinter based on Grisu3 #4333

Add FloatPrinter based on Grisu3 #4333

Conversation

will commented Apr 24, 2017

will commented Apr 24, 2017 • edited Loading

Sija commented Apr 25, 2017

bcardiff commented Apr 25, 2017

will commented Apr 25, 2017

will commented Apr 25, 2017

bcardiff commented Apr 25, 2017

will commented Apr 25, 2017

Sija Apr 29, 2017

Choose a reason for hiding this comment

will Apr 29, 2017

Choose a reason for hiding this comment

will commented May 1, 2017

Fryguy commented May 2, 2017

asterite commented Jun 4, 2017

RX14 commented Jun 4, 2017 • edited Loading

asterite commented Jun 4, 2017

asterite commented Jun 4, 2017

RX14 commented Jun 4, 2017 • edited Loading

asterite commented Jun 4, 2017

RX14 commented Jun 4, 2017

will commented Apr 24, 2017 •

edited

Loading

RX14 commented Jun 4, 2017 •

edited

Loading

RX14 commented Jun 4, 2017 •

edited

Loading