go/parser: isolate object resolution from parsing #45104

findleyr · 2021-03-18T16:03:00Z

This issue relates to an implementation detail of go/parser, but one that is fundamental enough to track independently.

Currently go/parser does a bit more than just parse and produce an AST: while parsing it creates *ast.Scopes and uses them to map identifiers to declarations (object resolution), storing the result of this resolution in ast.Ident.Obj, and reporting some declaration errors if, for example, there's no new identifier on the left side of ':='. There's a fair bit of overlap with go/types, which has its own more robust concept of scopes.

Furthermore:

identifier resolution is not really complete, missing e.g. struct fields or imported identifiers
based on some quick benchmarking, it takes somewhere from 20-40% of parsing time
being intermixed with parsing, it complicates the code, and makes it harder to align go/parser with cmd/compile/internal/syntax
since ident.Obj is used in some places, any alternative parser to go/parser must also implement the logic of object resolution (for example, @mvdan experimented with using the syntax parser and mapping to go/ast, and was blocked on this)
the test coverage for object resolution is limited

I'm working on improving go/parser error recovery (and to a lesser extent, performance), and it's proving to be easier if identifier resolution is moved out of the parsing pass into a separate post-processing pass (thanks @griesemer for the suggestion).

At this point I have a working version of go/parser that achieves this, albeit hacked-up so that I can run in both paradigms and compare the result. Here's the plan forward:

add additional tests for object resolution
finish extracting object resolution and label tracking from the parsing pass
add a new parser mode that allows skipping object resolution entirely if the caller knows it will not be needed (something like SkipResolution).
(maybe) expose object resolution as an independent API (something like func Resolve(*ast.Node, *ast.Scope), so that alternate parsers can use the same logic to enrich the AST with objects.

It's worth noting that gopls can't quite disable object resolution, as there are a few analyzers that use it.

The text was updated successfully, but these errors were encountered:

mvdan · 2021-03-18T16:09:51Z

I'd love this. I assume the vast majority of the Parse API users could skip object resolution. It would certainly help with syntax-only tools like gofmt and gofumpt.

I assume we can't change the default behavior because of the backwards compatibility guarantee.

Have you thought about go/packages? Since it supports go/types, I imagine practically noone should be relying on go/parser's object resolution.

findleyr · 2021-03-18T16:30:44Z

Have you thought about go/packages? Since it supports go/types, I imagine practically noone should be relying on go/parser's object resolution.

Not really. There's a pretty big gap between practically noone and noone. Given that parsing is generally a fraction of type checking (5-15% perhaps), I'm not sure it's worth skipping object resolution if it adds complexity to the API. However, for tools such as gofumpt (which needs only syntax) or maybe gopls, which manages parsing independently of go/packages, there is an opportunity for performance gains. We certainly can't change the default behavior for either the parser or go/packages.

Were the only benefit performance I'm not sure this would be worth doing. Also, in my prototype there appears to be a small but statistically significant cost of resolving in a post processing pass, simply due to the extra traversal -- maybe 2-5%. I'm sure I can make that cost up elsewhere if necessary :)

mvdan · 2021-03-18T18:55:14Z

Given that parsing is generally a fraction of type checking (5-15% perhaps), I'm not sure it's worth skipping object resolution if it adds complexity to the API

That's true, but it's also true that a good portion of go/packages users load syntax without loading type information. Whether or not it's worth adding API complexity is a fair point.

I'm sure I can make that cost up elsewhere if necessary :)

I'd even go as far as saying that a <10% slow-down is okay. Anywhere where the added cost will really matter can likely just enable the flag to skip object resolution and get a speed-up instead.

findleyr · 2021-03-22T21:20:18Z

Ok, now I've got everything matching the current behavior identically (after normalizing for bugs in the current implementation).
Quick update on timing (benchmarking with go/types/expr.go):

Disabling object resolution reduces parse time by 18.5%. The other half of the 40% number I was seeing earlier was, unfortunately, just a regression in 1.17 that was fixed by https://golang.org/cl/302629.
With no optimizations, resolving in a second pass through the AST increases parse time by 4%.
I have an optimization for object resolution that neutralizes this 4% slowdown, but it's somewhat complicated (it transposes the spaghetti stack to replace multiple map lookups with a single map lookup and slice scan). I don't think anyone will actually notice the 4% slowdown, so this probably isn't worth it.

CLs incoming, once I can reassemble these changes into a sensible chain.

mvdan · 2021-03-22T21:25:37Z

Happy to help review these, though I assume you want gri's approval on them anyway.

gopherbot · 2021-03-24T22:24:08Z

Change https://golang.org/cl/304455 mentions this issue: go/parser: switch to resolving objects as a post-processing pass

gopherbot · 2021-03-24T22:24:10Z

Change https://golang.org/cl/304453 mentions this issue: go/parser: add resolution tests for type params

gopherbot · 2021-03-24T22:24:10Z

Change https://golang.org/cl/304450 mentions this issue: go/parser: add data-driven tests for object resolution

findleyr · 2021-03-25T12:23:04Z

Happy to help review these, though I assume you want gri's approval on them anyway.

Feel free to review if you want, but yes I will require gri's approval :)

Add new tests for object resolution driven by source files with declarations and uses marked via special comments. This made it easier to add test coverage while refactoring object resolution for #45104. Tests are added to a new resolver_test.go file. In a subsequent CL the resolver.go file will be added, making this choice of file name more sensible. For #45104 For #45136 For #45160 Change-Id: I240fccc0de95aa8f2d03e39c77146d4c61f1ef9e Reviewed-on: https://go-review.googlesource.com/c/go/+/304450 Trust: Robert Findley <[email protected]> Trust: Robert Griesemer <[email protected]> Run-TryBot: Robert Findley <[email protected]> Reviewed-by: Robert Griesemer <[email protected]> TryBot-Result: Go Bot <[email protected]>

For #45104 For #45221 Change-Id: I8966555f4e8844d5b6766d00d48f7a81868ccf40 Reviewed-on: https://go-review.googlesource.com/c/go/+/304453 Trust: Robert Findley <[email protected]> Trust: Robert Griesemer <[email protected]> Run-TryBot: Robert Findley <[email protected]> TryBot-Result: Go Bot <[email protected]> Reviewed-by: Robert Griesemer <[email protected]>

Coupling object resolution to parsing complicates the parsing code, and is a barrier to improvement. It requires passing around context such as 'lhs' or 'keyOk', and even then sometimes requires guess-work, such as whether to resolve the key in a composite literal. In this CL we delay object resolution to a separate pass after the file parse completes. This makes it easier to see logic of scoping, and removes state from the parsing code. This can enable subsequent improvements such as optionally skipping object resolution, aligning the parser with cmd/compile/internal/syntax, and allowing alternative parsers to reuse object resolution. The additional AST traversal appears to slow down parsing by around 4%. That seems small enough not to worry about, especially since performance sensitive users may eventually be able to disable object resolution entirely, saving around 18% off the previous baseline. I'll also mail a speculative CL showing how we can significantly mitigate the cost of object resolution by transposing scopes. For #45104 Change-Id: I98d9143fd77ae29e84ec7c3ae2fdb1139510da37 Reviewed-on: https://go-review.googlesource.com/c/go/+/304455 Trust: Robert Findley <[email protected]> Trust: Robert Griesemer <[email protected]> Run-TryBot: Robert Findley <[email protected]> TryBot-Result: Go Bot <[email protected]> Reviewed-by: Robert Griesemer <[email protected]>

gopherbot · 2021-04-08T16:56:05Z

Change https://golang.org/cl/306149 mentions this issue: go/parser: add a SkipResolution method to bypass object resolution

findleyr · 2021-06-01T14:16:39Z

We had a bit of discussion on #46298, and the addition of SkipObjectResolution does warrant a proper proposal, which I've filed as #46485. Sorry, initially we thought this wasn't necessary for such a minor change.

I also think it could be beneficial to more formally consider the change. As I wrote in that proposal, I think it's a small benefit for smaller cost, but I'd like to hear from others. If the discussion is one-sided in favor of the new mode, we can perhaps still add it for 1.17, but I don't want to rush it.

cherrymui added the NeedsFix The path to resolution is known, but the work has not been done. label Mar 18, 2021

cherrymui added this to the Backlog milestone Mar 18, 2021

findleyr self-assigned this Mar 18, 2021

This was referenced Mar 19, 2021

go/parser: missing object for composite lit types identifiers beginning statements #45136

Closed

go/parser: composite lit fields resolve incorrectly #45160

Open

findleyr mentioned this issue Mar 24, 2021

go/parser: uses of function type params are not resolved in the signature #45221

Closed

gopherbot closed this as completed in 02a2ff4 Apr 16, 2021

ianlancetaylor mentioned this issue May 28, 2021

go/parser: back-out or document the SkipObjectResolution mode pending #46485 #46298

Closed

findleyr mentioned this issue Jun 1, 2021

go/parser: add a SkipObjectResolution mode flag #46485

Closed

rsc mentioned this issue Jun 10, 2021

api: audit for Go 1.17 #46688

Closed

83 tasks

findleyr mentioned this issue Sep 2, 2021

go/ast: clarify when Ident.Obj is nil and how to find the ident's Obj #48141

Closed

golang locked and limited conversation to collaborators Jun 1, 2022

gopherbot added the FrozenDueToAge label Jun 1, 2022

rsc unassigned findleyr Jun 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

go/parser: isolate object resolution from parsing #45104

go/parser: isolate object resolution from parsing #45104

findleyr commented Mar 18, 2021 •

edited

Loading

mvdan commented Mar 18, 2021

findleyr commented Mar 18, 2021

mvdan commented Mar 18, 2021

findleyr commented Mar 22, 2021

mvdan commented Mar 22, 2021

gopherbot commented Mar 24, 2021

gopherbot commented Mar 24, 2021

gopherbot commented Mar 24, 2021

findleyr commented Mar 25, 2021

gopherbot commented Apr 8, 2021

findleyr commented Jun 1, 2021

go/parser: isolate object resolution from parsing #45104

go/parser: isolate object resolution from parsing #45104

Comments

findleyr commented Mar 18, 2021 • edited Loading

mvdan commented Mar 18, 2021

findleyr commented Mar 18, 2021

mvdan commented Mar 18, 2021

findleyr commented Mar 22, 2021

mvdan commented Mar 22, 2021

gopherbot commented Mar 24, 2021

gopherbot commented Mar 24, 2021

gopherbot commented Mar 24, 2021

findleyr commented Mar 25, 2021

gopherbot commented Apr 8, 2021

findleyr commented Jun 1, 2021

findleyr commented Mar 18, 2021 •

edited

Loading