gh-97696: DRAFT asyncio eager tasks factory prototype #101613

itamaro · 2023-02-06T20:01:00Z

based on GH-98137 + feedback left there

still much left to do here, but something is working, so wanted to get early feedback!

with an opt build and the checked-in benchmarking script:

baseline (no eagerness)

./python.exe async_tree.py -s no_suspension -p
3.12.0a4+ (heads/gather-early-return-dirty:a2a7caf48d, Feb  3 2023, 16:14:11) [Clang 12.0.0 (clang-1200.0.32.28)]
/Users/itamaro/work/pyexe/main-opt/python.exe
Scenario: no_suspension
Time: 1.1034402861259878 s
Tasks created: 55989
Suspense called: 0

eager is ~2x faster in this scenario:

./python.exe async_tree.py -s no_suspension -p -e
3.12.0a4+ (heads/gather-early-return-dirty:a2a7caf48d, Feb  3 2023, 16:14:11) [Clang 12.0.0 (clang-1200.0.32.28)]
/Users/itamaro/work/pyexe/main-opt/python.exe
Scenario: no_suspension
Time: 0.5100075129885226 s
Tasks created: 3
Suspense called: 0

TODOs remaining:

to discuss:

do we want the task_factory arg I added to the runner? in this or a separate PR?
we are still leaving perf on the table by creating tasks and linking them to a gathering future in gather even when all future already completed. maybe something similar applies to taskgroups too, I didn't look yet. we can handle the case that all futures are completed to return synchronously (in a followup PR). any reason not to do it?
even more perf by reimplementing things in C (like gather)

Issue: Add eager task creation API to asyncio #97696

…ng to get it working

…ing on flags and feature availability

DinoV · 2023-02-14T01:03:48Z

Lib/asyncio/runners.py

@@ -45,10 +45,11 @@ class Runner:

    # Note: the class is final, it is not intended for inheritance.

-    def __init__(self, *, debug=None, loop_factory=None):
+    def __init__(self, *, debug=None, loop_factory=None, task_factory=None):


Do we really need a separate task_factory here? Ultimately this is just influencing the loop after it's created, couldn't that be rolled into the existing loop_factory mechanism?

no we don't need this. it was convenient to add it for my testing. sure, the same result can be achieved with the loop factory, but this feels cleaner, so I figured I'd suggest it and see what others think

Lib/asyncio/tasks.py

DinoV · 2023-02-14T01:15:13Z

Modules/_asynciomodule.c

+    }
+    else {
+        Py_INCREF(coro_result);
+        task_step2_impl(state, self, coro_result);


The return value should be checked here, I assume if it's NULL we'll want to error out. But there should also presumably be a test case added which will hit that.

I'd like to find the test case that would trip on this first :)

DinoV · 2023-02-14T01:18:49Z

Modules/_asynciomodule.c

+        }
+    }
+    else {
+        Py_INCREF(coro_result);


This incref seems a little weird? The caller should be keeping this alive for the lifetime of the call, and if task_step2_impl wants to hold onto it then it seems it should do the inc ref.

I ran into refcnt assertions and adding this made them go away 😬 I don't have a stronger justification for doing it here, so I'll need to dig deeper

from reading the code, I think the issue is on line 2911 in task_step_handle_result_impl, where the result is handed off to the task and there's currently a comment that "no incref is necessary." That comment made sense when result was fully a local variable of task_step_impl, so it could just hand off its owned reference to the task. but now that code is wrong (or at least, is assuming task_step_handle_result_impl steals the reference to its result argument, which isn't the typical/best approach.) So I think adding this incref is actually one "correct" option (as in, the refcounts all end up correct), but it's probably not the clearest / most maintainable option. The better option might be to add the incref down in task_step_handle_result_impl where it gives a reference to the task, and then add a decref of result at the end of task_step_impl after calling task_step_handle_result_impl (since task_step_handle_result_impl will no longer be stealing that reference.)

good point, thanks @carljm !

tried moving the incref where you suggested and still ran into negative refcounts (in different cases though, like "different loop" test). it works if I incref right in the beginning of the function - we can go with that, or I can try make it more granular by chasing down all the branches that would need the incref the result

So currently the code in task_step_handle_result_impl was written with the assumption that it owns its reference to result. I didn't initially look down through its code far enough to see all the places that assumption manifests, but in addition to line 2911, there's also line 3004, and then below that there are a bunch of error exit cases that do Py_DECREF(result), which only makes sense under the assumption that the reference is owned.

So the choices are either to incref it right away, so that the assumption made by the rest of the existing code continues to be true, or to modify the code so it increfs on line 2911 and 3004 (I think those are the only spots I see?) where it stores a new reference, and so it doesn't decref on all the various error exits at the end of the func.

The latter choice is more typical, because generally doing it more lazily results in fewer reference counting operations (i.e. in one of the error-exit cases, no refcount operations should be needed at all, but the "incref early" option means there's a pointless incref and then decref), and also because it's typically less error prone to future changes (instead of having to remember to decref result on every exit path that does nothing with result, which is harder to remember precisely because you aren't doing anything with result, you only have to incref on the paths where you actually do store a new reference to result, which is pretty natural. But it does mean more changes to the current code.

thanks for the explanation! super helpful :)
the last commit I just pushed (adding 2 increfs and removing a bunch of decrefs) seems to be working!

rename "step2", add TODOs to address before this is ready

itamaro added 6 commits February 3, 2023 16:51

Copy pythonGH-98137, make some changes, add benchmarking script, tryi…

1ea845c

…ng to get it working

asyncio runner takes optional task factory arg

de6d910

don't coro.send if the event loop is not running (yet)

56610e7

modify async tree script to support eager factory

34082a7

don't over-count tasks that yield result immediately

ce5beaf

handle task._source_traceback in eager task factory

38d7b0b

bedevere-bot added the awaiting review label Feb 6, 2023

bedevere-bot mentioned this pull request Feb 6, 2023

Add eager task creation API to asyncio #97696

Closed

arhadthedev added topic-asyncio stdlib Python modules in the Lib dir labels Feb 6, 2023

arhadthedev changed the title ~~[GH-97696] DRAFT asyncio eager tasks factory prototype~~ gh-97696: DRAFT asyncio eager tasks factory prototype Feb 6, 2023

itamaro added 5 commits February 10, 2023 13:47

stop checking for eager factory in taskgroups

6afbaab

refactor async tree benchmark to work with TaskGroup or gather depend…

a6e68bc

…ing on flags and feature availability

restore C task

464bd49

yield_result -> coro_result

ac26ad6

Support coro_result in Task C impl

ae2dcf3

itamaro force-pushed the asyncio-eager-tasks-playground branch from fa7c13e to ae2dcf3 Compare February 11, 2023 18:09

DinoV reviewed Feb 14, 2023

View reviewed changes

Lib/asyncio/tasks.py Outdated Show resolved Hide resolved

DinoV reviewed Feb 14, 2023

View reviewed changes

itamaro added 9 commits February 22, 2023 09:33

Merge branch 'main' into asyncio-eager-tasks-playground

9794715

Address Dino's review comments

0b447b0

rename "step2", add TODOs to address before this is ready

passthrough coro_result from custom constructor only if it is set

f2748e2

add != NULL assertion on step2 result

61ac5d0

Merge branch 'main' into asyncio-eager-tasks-playground

cbd14fb

fix result refcnting in task_step_handle_result_impl

3503337

Add eager task factory tests

49c0e89

cleanup eager task factory tests

8a5229a

add news blurb

d1e5fd1

itamaro added 3 commits March 15, 2023 12:19

Merge branch 'main' into asyncio-eager-tasks-playground

580fb1f

apply patchcheck fixes in asyncio.tasks

6284c41

regenerate clinic

34123a7

itamaro closed this Apr 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-97696: DRAFT asyncio eager tasks factory prototype #101613

gh-97696: DRAFT asyncio eager tasks factory prototype #101613

itamaro commented Feb 6, 2023 •

edited

Loading

DinoV Feb 14, 2023

itamaro Feb 14, 2023

DinoV Feb 14, 2023

itamaro Feb 14, 2023

DinoV Feb 14, 2023

itamaro Feb 14, 2023

carljm Mar 2, 2023

itamaro Mar 7, 2023

carljm Mar 8, 2023

itamaro Mar 10, 2023

gh-97696: DRAFT asyncio eager tasks factory prototype #101613

gh-97696: DRAFT asyncio eager tasks factory prototype #101613

Conversation

itamaro commented Feb 6, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

itamaro commented Feb 6, 2023 •

edited

Loading