proposal: cmd/compile: define register-based calling convention #18597

dr2chase · 2017-01-10T19:41:56Z

Performance would be generally somewhat improved if arguments-to and results-from function and method calls used registers instead of the stack. Projected improvements based on a limited prototype are in the range of 5-10%.

The running CL for the prototype: https://go-review.googlesource.com/c/28832/
The prototype, because it is a prototype, uses a pragma that should be unnecessary in the final design, though helpful during development.

(This is a placeholder bug for a design doc.)

problems/tradeoffs noted below, through 2017-01-19 (#18597 (comment))

This will reduce the usefulness of panic tracebacks because it will confuse/corrupt the per-frame argument information there. This was already a problem with SSA and register allocation and spilling, this will make it worse. Perhaps only results should be returned in registers.

Does this provide enough performance boost to justify "breaking" (how much?) existing assembly language?

Given that this breaks existing assembly language, why aren't we also doing a more thorough revision of the calling conventions to include callee-saves registers?

This should use the per-platform ABIs, to make interaction with native code go more smoothly.

Because this preserves the same memory layout of the existing calling, it will consume more stack space than it otherwise might, if that had been optimized out.

And the responses to above, roughly:

In compiler work, 5% (as a geometric mean across benchmarks) is a big deal.

Panic tracebacks are a problem; we will work on that. One possible mitigation is to use DWARF information to make tracebacks much better in general, including properly named and interpreted primitive values. A simpler mitigation for targeted debugging could be an annotation indicating that a function should be compiled to store arguments back to the stack (old style) to ensure that particular function's frame was adequately informative. This is also going to be an issue for improved inlining because regarded at a low level intermediate frames will disappear from backtraces.

The scope of required assembly-language changes is expected to be quite small; from Go-side function declarations the compiler ought to be able to create shims for the assembly to use around function entry, function exit, and surrounding assembly-language CALLs to named functions. The remaining case is assembly-language CALL via a pointer, and these are rare, especially outside the runtime. Thus, the need to bundle changes because they are disruptive is reduced, because the changes aren't that disruptive.

Incorporating callee-saves registers introduces a garbage-collector interaction that is not necessarily insurmountable, but other garbage-collected languages (e.g., Haskell) have been sufficiently intimidated by it that they elected not to use callee-saves registers. Because the assembler can also modify stack frames to include callee-save spill areas and introduce entry/exit shims to save/restore callee-save registers, this appears to have lower impact than initially estimated, and thus we have reduced need to bundle changes. In addition, if we stake out potential callee-save registers early developers can look ahead and adapt their code before it is required.

If each platform's ABI were used instead of this adaptation of the existing calling conventions, the assembly-language impact would be larger, the garbage-collector interactions would be larger, and either best-treatment for Go's multivalue returns would suffer or the cgo story would be sprinkled with asterisks and gotchas. As an alternative (a different way of obtaining a better story for cgo), would we consider a special annotation for cgo-related functions to indicate that exactly the platform calling conventions were used, with no compromises for Go performance?

gopherbot · 2017-01-10T21:55:42Z

CL https://golang.org/cl/35054 mentions this issue.

davecheney · 2017-01-11T02:14:03Z

5% for amd64 doesn't feel like it justifies the cost of breaking all the assembly written to date.

minux · 2017-01-11T02:25:44Z

Additionally, passing argument in registers will make displaying function arguments in traceback much harder, if not impossible. One reason I like Go so much is how Go's traceback makes debugging so much easier by providing enough contextual information about the frames. Frankly, I'd have expected much larger performance improvement for switching the calling convention to use registers. If the benefit is indeed only 5-10%, IMHO it's not worth the trouble. We should pursue better inlining and automatic vectorization instead. The Intel CPUs have invested so much circuit in optimizing the stack (operations), so I'm not too surprised though. The benefit should be larger on RISC architectures.

davecheney · 2017-01-11T02:41:35Z

I agree with Minux, optimising function calls for 10% speedup vs inlining and all the code generation benefits that unlocks doesn't feel like a good investment.

josharian · 2017-01-11T03:04:39Z

I'm still thinking about the proposal, but I will note that for asm functions with go prototypes and normal stack usage, we can do automated rewrites. Which is also a reminder that the scope for this proposal should include updating vet's asmdecl check.

minux · 2017-01-11T05:57:45Z

I wouldn't worry too much about assembly functions, as in the worse case, we can have an opt-in mechanisms for assembly functions and then develop a mode in cmd/asm to help people rewrite (or better yet, automate the translation in cmd/asm so that people won't even notice the difference. we can have a new textflag for new style assembly functions). However, I do care deeply about arguments in tracebacks. I tried patch set 30 of CL 28832 on this example: https://play.golang.org/p/UV-E4wyL2T And the result is: panic: 0 goroutine 1 [running]: panic(0x456a80, 0xc42000e118) $GOROOT/src/runtime/panic.go:531 +0x1cf main.F(0x50) /tmp/x.go:8 +0x6b main.F(0x0) /tmp/x.go:6 +0xa main.F(0x0) /tmp/x.go:6 +0xa main.F(0xc420052000) /tmp/x.go:6 +0xa main.F(0x0) /tmp/x.go:6 +0xa main.F(0x7) /tmp/x.go:6 +0xa main.F(0x7f6c34290000) /tmp/x.go:6 +0xa main.F(0x60) /tmp/x.go:6 +0xa main.F(0xc420070058) /tmp/x.go:6 +0xa main.F(0x0) /tmp/x.go:6 +0xa main.F(0xc4200001a0) /tmp/x.go:6 +0xa main.main() /tmp/x.go:13 +0x9 I'd like to hear if we have any plans for restoring the existing behavior of showing the first few arguments for each frame.

randall77 · 2017-01-11T07:06:10Z

@minux Yes, we will have all the information necessary to print args as we do now. I suspect it just isn't implemented yet.

minux · 2017-01-11T07:20:31Z

I'm curious to know how do you plan to implement it without incurring the overhead of storing the initial value into memory?

randall77 · 2017-01-11T07:37:21Z

The values enter in registers but they will be spilled to the arg slots if they are live at any call. So all live arguments will still be correct. Even modified live arguments should work (except for a few weird corner cases where multiple values derived from the same input are live).

Dead arguments are not correct even today. What you see in tracebacks is stale. They may get more wrong with this implementation, so it may be worth marking/not displaying dead args.

Things will get trickier if we allow callee-saved registers. We're thinking about it, but it isn't in the design doc yet.

minux · 2017-01-11T08:11:16Z

Just passing arguments in register while still reserving stack slots for them looks like MS's ABI. Isn't that negating a lot of the benefit of passing arguments in registers in the first place? Passing argument in register is precisely about not generating memory traffic for register arguments. Anyway, I'd like to read the full design docs of this and see the discussion of various design decisions and trade offs made.

randall77 · 2017-01-11T15:21:58Z

When we pass an argument in a register, the corresponding stack slot is reserved but not initialized, so there's no memory traffic for it. Only if the callee needs to spill it will that slot actually get initialized.

dr2chase · 2017-01-11T16:06:59Z

I'm not sure how "full" the design docs are.
I've got a CL up, and here's a more readable version of it that I'll try to keep up to date: https://gist.github.com/dr2chase/5a1107998024c76de22e122ed836562d

(Repeating some of that gist) I reviewed a bunch of ABIs, and the combination of

need someplace to spill if the stack must grow, and we have no frame yet
callee-saves makes tracing stacks for GC a good deal more painful; the impact on assembly language is also a good deal larger.
there's no memory traffic unless there's a spill, and then we must spill somewhere
most of the existing ABIs (all but Arm64) are unfriendly to multiple return values in registers
assurances from debugger/dump-inspection people (Delve and Backtrace.io) that they use DWARF, if we take the time to get it right

caused me to to decide to try something other than standard ABI(s). The new goal was to minimize change/cost while still getting plausible benefits, so I decided to keep as much of the existing design as possible.

The main loss in reserving stack space is increased stack size; we only spill if we must, but if we must, we spill to the "old" location.

As far as backtraces go, we need to improve our DWARF output for debuggers, and I think we will, and then use that to produce readable backtraces. That should get this as right as it can be (up to variables not being live) and would be more generally accessible than binary data.

So actually turning this on may be gated by DWARF improvements.

minux · 2017-01-11T20:49:19Z

Because we still spill arguments to stack when it's live across calls, this means using registered parameters won't help performance for most non-leaf functions (and any functions with moderate size.) Therefore, it seems registered parameters will mostly help small leaf functions. And if that's true, I imagine inlining those functions will actually provide even more speedup because then the compiler can optimize across function call boundaries. I'm not sure improving DWARF would help traceback. True, DWRAF provides an elaborated way to specify location of arguments (and variables), but for the case of traceback, we can only use SP relative addressing for the arguments: even though DWARF allows us to describe that at certain point in the program, certain argument is in a certain register, at the time of traceback, such information is useless because registers are almost guaranteed to be clobbered. What I'd like to see is concrete result that changing ABI is worthy the effort as compared to, say, better inlining support. Specifically, I like to see evidence that registering arguments help large functions that cannot be inlined. If we have to change the ABI, I think allowing some callee-saved registers will actually provide more benefit. And we need to cache g in a register on amd64 (like the RISC architectures). I remember Russ proposed that we maintain some essentially callee-saved register to save g. We should investigate general callee-saved ABIs. Prior experience shows that introducing callee-saved registers in a register-rich architecture helps performance considerably, and it's not restricted to small functions. Additionally, if we can align our callee-saved registers with the platform ABI, cgo callbacks will be faster because we always save all platform callee-saved registers at entry.

bcmills · 2017-01-12T00:17:53Z

if we can align our callee-saved registers with the platform ABI, cgo callbacks will be faster

Not to mention more reliable. A lot of the signal trampolines in the runtime could be a whole lot simpler if they didn't have to deal with the ABI mismatch between signal handlers (which must use the platform ABI) and the runtime's Go functions. (I'm still not entirely convinced that there aren't more calling-convention bugs lurking.)

minux · 2017-01-12T00:39:19Z

One problem of callee-saved registers is that we must figure out how to make GC cope with it. Otherwise if we can only save non-pointer in callee-saved registers across function calls, the benefit will be much smaller. This is complicated by the problem that when the callee decided to spill a callee-saved register, how do they know if it should be stored in pointer slots or not? Pointer tagging is out of the question as we don't box everything. My initial idea is having the caller reserves some stack space above the outgoing arguments area for all callee-saved registers and have (conditional) stack map for those. The callee will always save to designated slot for the register. To avoid clearing the save area, we need more pcdata to tell GC which of the callee-saved registers have been spilled to their respective slots. This solution does negate some of the benefit of having callee-saved registers, but coupled with better inlining, I think it could still outperform register arguments alone. Of course, one simple solution is to allow only pointers in callee-saved registers.

ianlancetaylor · 2017-01-12T01:35:18Z

At each call, have the PCDATA for that call record, for each callee-saved register, whether it holds 1) a pointer; 2) a non-pointer; 3) whatever it held on function entry. Also at each call, record the list of callee-saved registers saved from the caller, and where they are.

Now, on stack unwind, you can see the list of callee-saved registers. Then you look to the caller to see whether the value is a pointer or not. Of course you may have to look at the caller's caller, etc., but since the number of callee-saved registers is small and fixed it's easy to keep track of what you need to do as you walk up the stack.

philhofer · 2017-01-12T02:39:39Z

5% for amd64 doesn't feel like it justifies the cost of breaking all the assembly written to date.

Much like SSA, I expect this change will have a much more meaningful effect on minority architectures.

Those fancy Intel chips can resolve push/pop pairs halfway through the instruction pipeline (without even emitting a uop!), while a load from the stack on a Cortex A57 has a 4-cycle result latency..

minux · 2017-01-12T04:23:24Z

On Wed, Jan 11, 2017 at 9:39 PM, Philip Hofer ***@***.***> wrote: 5% for amd64 doesn't feel like it justifies the cost of breaking all the assembly written to date. Much like SSA, I expect this change will have a much more meaningful effect on minority architectures. Those fancy Intel chips can resolve push/pop pairs halfway through the instruction pipeline (without even emitting a uop!), while a load from the stack on a Cortex A57 has a 4-cycle result latency. <http://infocenter.arm.com/help/topic/com.arm.doc.uan0015b/Cortex_A57_Software_Optimization_Guide_external.pdf> .

Note: the gc compiler don't ever generate push or pop instructions. It always uses mov to access the stack. Result latency doesn't matter much on a out-of-order core because L1D on Intel chips has similar latency. I suspect the benefit could be higher, but for non-leaf functions we still need to spill the argument registers to stack, which negates most of the benefit for large and non-leaf functions.

philhofer · 2017-01-12T04:59:29Z

Result latency doesn't matter much on a out-of-order core because L1D on
Intel
chips has similar latency.

It especially doesn't matter if you have shadow registers, but AFAIK none of the arm/arm64 designs have 'em.

I suspect the benefit could be higher, but for non-leaf functions we still
need to
spill the argument registers to stack, which negates most of the benefit
for large
and non-leaf functions.

You can't just move them to callee-saves if they're still live? Is that for GC, or a requirement for traceback?

minux · 2017-01-12T05:04:05Z

On Wed, Jan 11, 2017 at 11:59 PM, Philip Hofer ***@***.***> wrote: I suspect the benefit could be higher, but for non-leaf functions we still need to spill the argument registers to stack, which negates most of the benefit for large and non-leaf functions. You can't just move them to callee-saves if they're still live? Is that for GC, or a requirement for traceback?

In the current ABI, every register is caller-save. My counterproposal is introducing callee-save registers but keep arguments passing on stack to preserve the current traceback behavior.

philhofer · 2017-01-12T05:11:51Z

In the current ABI, every register is caller-save. My counterproposal is
introducing callee-save
registers but keep arguments passing on stack to preserve the current
traceback behavior.

Ah. I guess I assumed, incorrectly, that this proposal included making some registers callee-save.

cherrymui · 2017-01-12T16:59:15Z

Therefore, it seems registered parameters will mostly help small leaf
functions. And if that's true, I imagine inlining those functions will
actually provide even more speedup because then the compiler can optimize
across function call boundaries.

Inlining doesn't help on function pointer calls.

bcmills · 2017-01-12T17:34:24Z

@dr2chase

there's no memory traffic unless there's a spill, and then we must spill somewhere

That's not strictly true. Empty stack slots still decrease cache locality by forcing the actual in-use stack across a larger number of cache lines. There's no extra memory traffic between the CPU and cache unless there's a spill, but reserving stack slots may well increase memory traffic on the bus.

most of the existing ABIs (all but Arm64) are unfriendly to multiple return values in registers

Not true. It would be trivial to extend the SysV AMD64 ABI for multiple return values, for example: it already has two return registers (rax and rdx), and it's easy enough to define extensions of the ABI that pass additional Go return values in other caller-save registers.

For example, we could do something like:

rax: varargs count; 1st return
rbx: callee-saved
rcx: 4th argument; 3rd return
rdx: 3rd argument; 2nd return
rsp: stack pointer
rbp: frame pointer
rsi: 2nd argument
rdi: 1st argument
r8: 5th argument; 4th return
r9: 6th argument; 5th return
r11: temporary register
r12-r14: callee-saved
r15: GOT base pointer

mundaym · 2017-01-13T18:26:32Z

Am I understanding correctly that this proposal is calling for structs to always be unpacked into registers before a call? Has any thought been given to passing structs as read-only references? I think this is how most (all?) of the ELF ABIs handle structs, particularly larger ones.

This way the callee gets to decide whether it needs to create a local copy of anything, avoiding copying in many cases. It is also presumably fairly common for only a subset of struct fields to be accessed so unpacking all or part of the struct may be unecessarily expensive (particularly if the struct has boolean fields). Obviously the reference would have to be to something on the stack (copied there, if necessary, by the caller) or to read-only memory.

For Go in particular it seems like this would be a nice fit because the only way to get const-like behaviour is to pass structs by value and internally passing them by reference instead would potentially make that idiom a lot cheaper.

Arrays and maybe slices could also be handled the same way.

minux · 2017-01-13T18:36:59Z

On Jan 12, 2017 11:59 AM, "cherrymui" <[email protected]> wrote: Therefore, it seems registered parameters will mostly help small leaf functions. And if that's true, I imagine inlining those functions will actually provide even more speedup because then the compiler can optimize across function call boundaries. Inlining doesn't help on function pointer calls. Yes, but I doubt passing arguments help much either (at least in the current proposal where it might still spill to memory). A better fix could be de-virtualization or speculation. On the other hanf, having callee-saved saved registers could help more (we can have more callee-saved registers that most function have arguments.)

randall77 · 2017-01-19T23:10:17Z

That the proposals conflict is ok. And we can discuss those conflicts here. But without a concrete proposal about how you'd like to make tracebacks better it's hard to see exactly what those conflicts would be.

aarzilli · 2017-02-03T20:03:27Z

I'm interested in knowing how this would interact with non-optimized compilation. Right now non-optimized compilations registerize sparingly, which is great for debuggers since the go compiler isn't good at saving informations about registerization. Putting a lot more things into registers without getting better at writing the appropriate debug symbols would be bad.

bcmills · 2017-02-03T20:08:47Z

I'm interested in knowing how this would interact with non-optimized compilation. Right now non-optimized compilations registerize sparingly, which is great for debuggers since the go compiler isn't good at saving informations about registerization.

Presumably part of the changes for the calling convention would be making the compiler emit more accurate DWARF info for register parameters.

rsc · 2017-04-03T20:53:57Z

On hold until @dr2chase is ready to proceed.

dr2chase · 2017-04-03T21:27:53Z

Current plan is 1.10, time got reallocated to loop unrolling and whatever help was required to get the Dwarf support better in general, so that we have the ability to say what we're doing with parameters.

There's been a lot of churn in the compiler in the last couple of months -- new source position, pushing the AST through, making things more ready for running in parallel, and moving liveness into the SSA form -- so I am okay with waiting a release.

smasher164 · 2017-10-31T02:58:13Z

Would the proposed calling convention omit frame pointers for leaf functions? I can see this varying based on whether the function stores the state of a callee-save register, takes the address of a local variable, etc. In that case, is there still a feasible way to obtain the call stack?

wkhere · 2018-01-06T02:22:19Z

A naive proposal:

maybe a good optimization would be, instead of changing calling convention, to focus on eliminating local variables on stack by using registers, and sharing as much as possible the stack frame space between variables and arguments/return values of subcalls.

This way (1) registers would speed up things, (2) stack frames would be smaller, (3) existing calling convention would be preserved.

davecheney · 2018-01-06T04:28:15Z

How would that work when a function has more than one caller?

…

On 6 Jan 2018, at 13:22, Wojciech Kaczmarek ***@***.***> wrote: A naive proposal: maybe a good optimization would be, instead of changing calling convention, to focus on eliminating local variables on stack by using registers, and sharing as much as possible the stack frame space between variables and arguments/return values of subcalls. This way (1) registers would speed up things, (2) stack frames would be smaller, (3) existing calling convention would be preserved. — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

quasilyte · 2018-02-07T21:09:19Z

I can be wrong, but register-based calling convention (CC) could give more performance boost if additional SSA rules are added that actually take advantage of it.

Currently, a big amount of small and average functions have many mem->reg->mem patterns that make certain optimizations on amd64 not worth it.

It is not fair to say that 10% is not significant.
It's 10% with optimizer that works on GOARCH=amd64 like it is GOARCH=386.
The potential gain is higher.

Lately, I was comparing 6g with gccgo and even without machine-dependent optimizations, aggressive inlining and constant folding there was about 25-40% performance difference in some allocation-free benchmarks. I do believe that this difference includes CC impact (because most other parts of output machine code look nearly the same).

laboger · 2018-02-08T16:56:16Z

As noted above, the performance benefit for a register-based calling convention is higher on RISC architectures like ppc64le & ppc64. If the above CL is not stale I would be willing to try it out. To me this is one of the biggest performance issues with golang on ppc64le.

Inlining does help but doesn't handle all cases if there are conditions that inhibit inlining. Also functions written in asm can't be inlined AFAIK.

dr2chase · 2018-02-08T23:43:54Z

The CL is very stale. First we have to get to a good place with debugger information (seems very likely for 1.10 [oops, 1.11]), and then we have to redo some of the experiment (it will go more quickly this time) with a better set of benchmarks. Some optimizations that looked promising on the go1 benchmarks turn out to look considerably less profitable when run against a wider set of benchmarks.

as · 2018-02-09T01:48:54Z

@dr2chase

Is the method used to determine the 5% performance improvement somewhere in the discussion? I'm curiously interested in what the function sample space actually looks like.

laboger · 2018-02-09T15:16:50Z

I think the measurement we want is how much does this help once inlining is fully enabled by default.

dr2chase · 2018-02-09T15:29:27Z

The estimate was based on counting call frequencies, looking at performance improvement on a small number of popular calls "registerized", and extrapolating that to the full number of calls.

I agree that we want to try better inlining first (it's in the queue for 1.11, now that we have that debugging information fixed), since that will cut the call count (and thus reduce the benefit) of this somewhat more complicated optimization. Mid-stack inlining, however, was one of the optimizations that looked a lot less good when applied to the larger suite of benchmarks. One problem with the larger benchmark suite is selection effect -- anyone who wrote a benchmark for parts of their application cares enough about a performance to write that benchmark, and has probably used it already to hand-optimize around rough spots in the Go implementation, so we'll see reduced gains from optimizing those rough spots.

CAFxX · 2018-02-10T00:31:37Z

and has probably used it already to hand-optimize around rough spots in the Go implementation

From my point of view, the goal of having a better inliner is to have more readable/maintainable (less "hand-optimized") code, not really faster code. So that we can build code addressing the problems we're trying to solve, not the shortcomings of the compiler.

ianlancetaylor · 2020-08-12T16:25:26Z

There is an updated proposal at #40724, which addresses many of the issues raised here. Closing this proposal in favor of that one.

dr2chase added this to the Go1.9Maybe milestone Jan 10, 2017

dr2chase self-assigned this Jan 10, 2017

cespare added the Proposal label Jan 10, 2017

dsnet modified the milestones: Proposal, Go1.9Maybe Jan 10, 2017

rsc changed the title ~~proposal: function and method arguments and results should be passed in registers~~ proposal: cmd/compile: define register-based calling convention Jan 23, 2017

kelleyk mentioned this issue Feb 13, 2017

trace.py support for args with Go's gc compiler iovisor/bcc#934

Open

rsc added the Proposal-Hold label Apr 3, 2017

bcmills mentioned this issue Aug 30, 2017

runtime: provide a mechanism for debuggers to execute a function in the program being debugged #21678

Closed

aclements mentioned this issue Sep 6, 2018

proposal: runtime: make the ABI undefined as a step toward changing it #27539

Closed

robaho mentioned this issue Dec 14, 2018

cmd/compile: improve interface dispatch performance #29276

Open

aclements mentioned this issue Apr 1, 2019

cmd/compile: improve inlining cost model #17566

Open

randall77 mentioned this issue Apr 23, 2020

cmd/compile: huge waste stack when make function calls #38588

Closed

at15 mentioned this issue Jul 21, 2020

[post] Function calling convention in C, Go, Rust and OS dyweb/blog#55

Open

aclements mentioned this issue Aug 12, 2020

cmd/compile: switch to a register-based calling convention for Go functions #40724

Open

ianlancetaylor closed this as completed Aug 12, 2020

golang locked and limited conversation to collaborators Nov 13, 2021

gopherbot added the FrozenDueToAge label Nov 13, 2021

rsc unassigned dr2chase Jun 23, 2022

proposal: cmd/compile: define register-based calling convention #18597

proposal: cmd/compile: define register-based calling convention #18597

Comments

dr2chase commented Jan 10, 2017 • edited Loading

(This is a placeholder bug for a design doc.)

gopherbot commented Jan 10, 2017

davecheney commented Jan 11, 2017 • edited Loading

minux commented Jan 11, 2017 via email

davecheney commented Jan 11, 2017

josharian commented Jan 11, 2017

minux commented Jan 11, 2017 via email

randall77 commented Jan 11, 2017

minux commented Jan 11, 2017 via email

randall77 commented Jan 11, 2017

minux commented Jan 11, 2017 via email

randall77 commented Jan 11, 2017

dr2chase commented Jan 11, 2017

minux commented Jan 11, 2017 via email

bcmills commented Jan 12, 2017

minux commented Jan 12, 2017 via email

ianlancetaylor commented Jan 12, 2017

philhofer commented Jan 12, 2017

minux commented Jan 12, 2017 via email

philhofer commented Jan 12, 2017

minux commented Jan 12, 2017 via email

philhofer commented Jan 12, 2017

cherrymui commented Jan 12, 2017

bcmills commented Jan 12, 2017 • edited Loading

mundaym commented Jan 13, 2017 • edited Loading

minux commented Jan 13, 2017 via email

randall77 commented Jan 19, 2017

aarzilli commented Feb 3, 2017

bcmills commented Feb 3, 2017

rsc commented Apr 3, 2017

dr2chase commented Apr 3, 2017

smasher164 commented Oct 31, 2017

wkhere commented Jan 6, 2018

davecheney commented Jan 6, 2018 via email

quasilyte commented Feb 7, 2018 • edited Loading

laboger commented Feb 8, 2018

dr2chase commented Feb 8, 2018 • edited Loading

as commented Feb 9, 2018 • edited Loading

laboger commented Feb 9, 2018

dr2chase commented Feb 9, 2018

CAFxX commented Feb 10, 2018 • edited Loading

ianlancetaylor commented Aug 12, 2020

dr2chase commented Jan 10, 2017 •

edited

Loading

davecheney commented Jan 11, 2017 •

edited

Loading

bcmills commented Jan 12, 2017 •

edited

Loading

mundaym commented Jan 13, 2017 •

edited

Loading

quasilyte commented Feb 7, 2018 •

edited

Loading

dr2chase commented Feb 8, 2018 •

edited

Loading

as commented Feb 9, 2018 •

edited

Loading

CAFxX commented Feb 10, 2018 •

edited

Loading