-
-
Notifications
You must be signed in to change notification settings - Fork 30.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gh-111786: Use separate opcode vars for Tier 1 and Tier 2 #112289
Conversation
00a506a
to
27a5654
Compare
This makes Windows about 3% faster on pyperformance benchmarks.
27a5654
to
f362f9a
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice, looks very straightforward.
Though based on our offline conversation, did you find out whether Windows PGO uses -E
?
Python/ceval.c
Outdated
uint8_t opcode; /* Current opcode */ | ||
int oparg; /* Current opcode argument, if any */ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Heh, I would have kept the comments at the same column, so you don't appear to be editing the oparg
line. :-)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure -- I think my editor did something magical.
I don't know, but I think that's orthogonal to this. Currently on the benchmarking infrastructure, we only pass the |
Ah, sorry. Should I merge this then?
1% faster than without Tier 2, or 1% faster than main? |
Yeah, I think that's fine.
When you turn Tier 2 on during build (PGO collection), it's 1% faster than if you don't. (Using Tier 2 at runtime in both cases). |
That's great news! |
…on#112289) This makes Windows about 3% faster on pyperformance benchmarks.
…on#112289) This makes Windows about 3% faster on pyperformance benchmarks.
Suggested by @neonene: #111786 (comment)
This makes Windows about 3% faster on pyperformance benchmarks. See analysis here