cmd/compile: record and use per-function optimization data #25999

josharian · 2018-06-21T17:47:22Z

This is an open-ended performance idea to explore.

We have per-function compiler-specific export data (see method funcExt in iexport.go). We could do some analysis in package ssa of return values, add that to the export data, and use it on import. This might help with function calls that cannot be inlined.

For example, we could record whether a return value is known to be non-nil. Or a known limited range for a return value. Or a concrete type for a function that returns an interface.

Related: #25862. If we pursue this idea, the mechanism for using info about the return values could be re-used with some annotation for #25862. And possibly also to generalize and simplify the ssa rule "don't nilcheck the return value of newobject".

We could also return things like the ratio of Nodes to instructions, which we might want to use to improve downstream inlining decisions.

One weird thing about doing this is that calling functions from imported packages might optimize further than calling functions from within the same packages. (A similar problem arises for some of the ideas mooted in #17566.) Still probably worth exploring.

cc @randall77 @martisch @dr2chase @cherrymui

TocarIP · 2018-06-21T18:14:59Z

Sounds interesting! We could also record used registers and avoid spills of unclobbered registers.

josharian · 2018-06-21T18:18:33Z

@TocarIP oooooooooooooooh.

dr2chase · 2018-06-21T18:25:50Z

This has minor implications for the GC, though it could be arranged that pointers are always spilled, integers perhaps not.

CAFxX · 2018-06-27T21:53:00Z

Would it be possible to also propagate stack usage (if known/constant), so that the caller can call morestack once (to ensure there's enough for both itself and the callees) and the callees can omit the morestack check/call?

dr2chase · 2018-06-27T22:04:42Z

@CAFxX That becomes possible when goroutine preemption can be done with signals. This is in the pipeline, and with luck will emerge in 1.12. This has some interaction with how the runtime/GC handles stack resizing -- if F calls G calls H, F is responsible for all their frames, if the GC ran during G and reduced the stack to what was adequate for G, a subsequent call to H might overrun the stack. So we'd need to work on some sort of a handshake or marker to prevent this.

josharian added the Performance label Jun 21, 2018

josharian added this to the Unplanned milestone Jun 21, 2018

josharian mentioned this issue Jul 22, 2018

cmd/compile: optimise away deferred calls to empty functions #26534

Open

martisch mentioned this issue Aug 23, 2018

cmd/compile: optimise ranges over string(byteSlice) #27148

Open

CAFxX mentioned this issue Sep 7, 2018

runtime: use parent goroutine's stack for new goroutines #27345

Open

josharian mentioned this issue Oct 9, 2018

cmd/go: only rebuild dependent packages when export data has changed #15752

Open

josharian mentioned this issue Oct 23, 2018

cmd/compile: recognize rand.Intn() is bounded for BCE #28314

Open

CAFxX mentioned this issue Dec 3, 2018

cmd/compile: minimize morestack calls text footprint #29067

Open

josharian mentioned this issue Mar 31, 2019

cmd/compile: improve inlining cost model #17566

Open

ALTree added the NeedsInvestigation Someone must examine and confirm this is a valid issue and not a duplicate of an existing one. label Oct 12, 2020

gopherbot added the compiler/runtime Issues related to the Go compiler and/or runtime. label Jul 13, 2022

mknyszek added this to Go Compiler / Runtime Jul 13, 2022

mknyszek removed this from Go Compiler / Runtime Jul 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cmd/compile: record and use per-function optimization data #25999

cmd/compile: record and use per-function optimization data #25999

josharian commented Jun 21, 2018

TocarIP commented Jun 21, 2018

josharian commented Jun 21, 2018

dr2chase commented Jun 21, 2018

CAFxX commented Jun 27, 2018 •

edited

Loading

dr2chase commented Jun 27, 2018

cmd/compile: record and use per-function optimization data #25999

cmd/compile: record and use per-function optimization data #25999

Comments

josharian commented Jun 21, 2018

TocarIP commented Jun 21, 2018

josharian commented Jun 21, 2018

dr2chase commented Jun 21, 2018

CAFxX commented Jun 27, 2018 • edited Loading

dr2chase commented Jun 27, 2018

CAFxX commented Jun 27, 2018 •

edited

Loading