gh-114058: Foundations of the Tier2 redundancy eliminator #115085

Fidget-Spinner · 2024-02-06T14:42:40Z

This sets up the tier 2 optimizer's redundancy eliminator foundations.

It does the following:

Aliasing information for types (not on all expressions though, that is too expensive)
Basic Type propagation
Basic Guard elimination

It's basically a port of #114059 over using Mark's DSL. Constant propagation and a lot of type propagation rules torn out, for the sake of simpler code for now. We can easily add more later. I just want to get the foundations in.

I expect no speedups for this, as for simpler reviewing, most of the code is torn out. However, I believe that #114059 already shows this approach has potential for speedups.

Issue: Tier 2 optimizer's abstract interpreter #114058

Co-Authored-By: Mark Shannon <[email protected]>

Co-Authored-By: Jules <[email protected]> Co-Authored-By: Guido van Rossum <[email protected]>

Tools/cases_generator/tier2_abstract_generator.py

Tools/cases_generator/interpreter_definition.md

markshannon

This looks much closer to what I think we need. Thanks.

What's the API for _Py_UOpsSymType? It isn't clear what's internal and what's API.

Python/optimizer_analysis.c

markshannon · 2024-02-06T15:33:15Z

Python/optimizer_analysis.c

+    _Py_UOpsSymType **stack_pointer;
+    _Py_UOpsSymType **stack;
+    _Py_UOpsSymType **locals;
+} _Py_UOpsAbstractFrame;


You can use much the same layout as an actual frame and put the locals and stack at the end:

... _Py_UOpsSymType **stack_pointer; _Py_UOpsSymType *localsplus[1]; }

I initially did that, but this new layout is easier for inlining, because it actually models the state if all frames were to be inlined. So it makes things like calculating the new locals offset easier.

But we aren't inlining yet, and we might choose a different approach. Making it a single struct would make memory management simpler.

Right now making it a single struct would make memory management a little more complex, because my "frames" are just an array of static sized structs. Their localsplus is just accessing a giant pool. frame->prev is just index--. It's all rather clean. And I just need to manage memory once instead of for 7 giant frames.

Ideally, we won't be doing any allocations, just reusing a buffer on the thread-state. But's that future PR.
But we still need to manage the memory within that buffer, and monolithic frames will make that simpler.
This should be fine for now, though.

Python/optimizer_analysis.c

Python/tier2_redundancy_eliminator_bytecodes.c

markshannon · 2024-02-06T18:00:54Z

How hard would it be to change the API to take PyTypeObject * for the types in the API? It would make it easier to use.

Fidget-Spinner · 2024-02-06T18:04:00Z

How hard would it be to change the API to take PyTypeObject * for the types in the API? It would make it easier to use.

Quite possible. I can just convert to the internal representation internally. (The representation is just to save space, and support unions in the future).

The only thing is there are some "types" that cant be represented using pytype object. In that case I will special case them.

markshannon · 2024-02-07T10:28:20Z

The only thing is there are some "types" that cant be represented using pytype object. In that case I will special case them.

It is probably best to add those cases to the API.
For example if you have "not NULL" or "NULL" as nodes in the lattice, then adding a new/set_null and new/set_not_null API would make sense.

Python/optimizer_analysis.c

Tools/cases_generator/tier2_abstract_generator.py

markshannon · 2024-02-13T10:38:24Z

Python/optimizer_analysis.c

+
+#define GETLOCAL(idx)          ((ctx->frame->locals[idx]))
+
+#define REPLACE_OP(op, arg, oper)    \


I don't like macros that assume context, and we rarely (maybe never) want to set the oparg or operand.

Suggested change

#define REPLACE_OP(op, arg, oper) \

#define REPLACE_OP(INST, OP) (INST)->opcode = (OP)

We will need operand once we start burning in _LOAD_CONST_INLINE and friends.

Python/optimizer_analysis.c

markshannon · 2024-02-13T10:50:11Z

Python/optimizer_analysis.c

+
+
+static void
+remove_unneeded_uops(_PyUOpInstruction *buffer, int buffer_size)


Has this changed, or has it just moved?

I did not change it. It just moved.

markshannon · 2024-02-13T10:52:20Z

Python/optimizer_analysis.c

+    // The only valid error we can raise is MemoryError.
+    // Other times it's not really errors but things like not being able
+    // to fetch a function version because the function got deleted.
+    return PyErr_Occurred() ? -1 : 0;


Why do you need to check PyErr_Occurred()? uop_redundancy_eliminator should not be returning -1 unless an error has occurred.

In the old code, constant propagation could raise MemoryError.

Include/cpython/pystats.h

Python/optimizer_analysis.c

…iminator

Python/tier2_redundancy_eliminator_bytecodes.c

markshannon

Looks good now.

…onGH-115085) --------- Co-authored-by: Mark Shannon <[email protected]> Co-authored-by: Jules <[email protected]> Co-authored-by: Guido van Rossum <[email protected]>

Fidget-Spinner and others added 5 commits February 6, 2024 12:57

bring over changes from old branch

169e08d

Port over to Mark's DSL

e270364

Co-Authored-By: Mark Shannon <[email protected]>

reduce diff

e8a8b78

Co-Authored-By: Jules <[email protected]> Co-Authored-By: Guido van Rossum <[email protected]>

fix passthrough

bb6137a

fix tests

9b15f9e

Fidget-Spinner requested review from gvanrossum, ericsnowcurrently, markshannon and brandtbucher as code owners February 6, 2024 14:42

bedevere-app bot added the awaiting core review label Feb 6, 2024

bedevere-app bot mentioned this pull request Feb 6, 2024

Tier 2 optimizer's abstract interpreter #114058

Closed

Fidget-Spinner mentioned this pull request Feb 6, 2024

gh-114058: The Tier2 Optimizer #114059

Closed

fix mypy issues

66bfce5

Eclips4 reviewed Feb 6, 2024

View reviewed changes

Tools/cases_generator/tier2_abstract_generator.py Show resolved Hide resolved

Eclips4 reviewed Feb 6, 2024

View reviewed changes

Tools/cases_generator/tier2_abstract_generator.py Outdated Show resolved Hide resolved

Tools/cases_generator/tier2_abstract_generator.py Outdated Show resolved Hide resolved

Tools/cases_generator/tier2_abstract_generator.py Outdated Show resolved Hide resolved

Fidget-Spinner added 3 commits February 6, 2024 23:19

fix tests and address review

3789cd9

add tests for abstract cases generator

20d087b

patchcheck

0f58526

Eclips4 reviewed Feb 6, 2024

View reviewed changes

Tools/cases_generator/interpreter_definition.md Outdated Show resolved Hide resolved

markshannon reviewed Feb 6, 2024

View reviewed changes

Fidget-Spinner added 5 commits February 7, 2024 15:36

Address review

c55ba14

reduce diff, add tests

e34a0f5

fix compilation problems

229aab0

don't needlessly clear aliasing information

3c8def4

remove is_const for now

649b0c7

markshannon reviewed Feb 7, 2024

View reviewed changes

Python/optimizer_analysis.c Outdated Show resolved Hide resolved

markshannon reviewed Feb 7, 2024

View reviewed changes

Tools/cases_generator/tier2_abstract_generator.py Outdated Show resolved Hide resolved

Address review

702a6fc

markshannon reviewed Feb 13, 2024

View reviewed changes

Python/optimizer_analysis.c Outdated Show resolved Hide resolved

markshannon reviewed Feb 13, 2024

View reviewed changes

Python/optimizer_analysis.c Outdated Show resolved Hide resolved

markshannon reviewed Feb 13, 2024

View reviewed changes

Include/cpython/pystats.h Outdated Show resolved Hide resolved

markshannon reviewed Feb 13, 2024

View reviewed changes

Python/optimizer_analysis.c Outdated Show resolved Hide resolved

Fidget-Spinner added 4 commits February 13, 2024 19:01

address review

77e56ff

Merge remote-tracking branch 'upstream/main' into tier2_redundancy_el…

c457098

…iminator

rename inst -> this_instr

fcd31af

address review

92b211c

markshannon reviewed Feb 13, 2024

View reviewed changes

Python/tier2_redundancy_eliminator_bytecodes.c Outdated Show resolved Hide resolved

Fidget-Spinner added 5 commits February 13, 2024 19:17

remove whitespace

1bf6ef8

remove assert

ea3755f

remove enabled

f09e3c3

fix up error codes

9670b23

fix error code again

38420c3

markshannon self-requested a review February 13, 2024 12:16

markshannon approved these changes Feb 13, 2024

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting changes labels Feb 13, 2024

Fidget-Spinner enabled auto-merge (squash) February 13, 2024 12:20

Fidget-Spinner disabled auto-merge February 13, 2024 12:32

Fidget-Spinner merged commit 7cce857 into python:main Feb 13, 2024
59 of 60 checks passed

bedevere-app bot removed the awaiting merge label Feb 13, 2024

Fidget-Spinner deleted the tier2_redundancy_eliminator branch February 13, 2024 13:24

sobolevn mentioned this pull request Feb 16, 2024

Add Python/tier2_redundancy_eliminator_cases.c.h to .gitattributes as generated #115551

Merged

mdboom mentioned this pull request Mar 15, 2024

pystats: New optimizer stats for _Py_uop_analyze_and_optimize are missing from the table #116879

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-114058: Foundations of the Tier2 redundancy eliminator #115085

gh-114058: Foundations of the Tier2 redundancy eliminator #115085

Fidget-Spinner commented Feb 6, 2024 •

edited

Loading

markshannon left a comment

markshannon Feb 6, 2024

Fidget-Spinner Feb 7, 2024 •

edited

Loading

markshannon Feb 7, 2024

Fidget-Spinner Feb 7, 2024 •

edited

Loading

markshannon Feb 8, 2024

markshannon commented Feb 6, 2024

Fidget-Spinner commented Feb 6, 2024 •

edited

Loading

markshannon commented Feb 7, 2024

markshannon Feb 13, 2024

Fidget-Spinner Feb 13, 2024

markshannon Feb 13, 2024

Fidget-Spinner Feb 13, 2024

markshannon Feb 13, 2024

Fidget-Spinner Feb 13, 2024

markshannon left a comment


		#define GETLOCAL(idx) ((ctx->frame->locals[idx]))

		#define REPLACE_OP(op, arg, oper) \

	#define REPLACE_OP(op, arg, oper) \
	#define REPLACE_OP(INST, OP) (INST)->opcode = (OP)



		static void
		remove_unneeded_uops(_PyUOpInstruction *buffer, int buffer_size)

gh-114058: Foundations of the Tier2 redundancy eliminator #115085

gh-114058: Foundations of the Tier2 redundancy eliminator #115085

Conversation

Fidget-Spinner commented Feb 6, 2024 • edited Loading

markshannon left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Fidget-Spinner Feb 7, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Fidget-Spinner Feb 7, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

markshannon commented Feb 6, 2024

Fidget-Spinner commented Feb 6, 2024 • edited Loading

markshannon commented Feb 7, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

markshannon left a comment

Choose a reason for hiding this comment

Fidget-Spinner commented Feb 6, 2024 •

edited

Loading

Fidget-Spinner Feb 7, 2024 •

edited

Loading

Fidget-Spinner Feb 7, 2024 •

edited

Loading

Fidget-Spinner commented Feb 6, 2024 •

edited

Loading