[FEA] Pass `cudaStreamPerThread` to numba/CuPy kernels #5922

shwina · 2020-08-11T13:45:41Z

Memory allocations should already use PTDS since both numba and CuPy allocate memory using RMM. Kernels on the other hand, may explicitly need to be passed the cudaStreamPerThread stream handle.

cc: @jakirkham @kkraus14

The text was updated successfully, but these errors were encountered:

jakirkham · 2020-08-11T17:30:55Z

Some relevant issues listed below:

xref: rapidsai/dask-cuda#96
xref: cupy/cupy#3755
xref: numba/numba#5137

Also some relevant PRs below:

xref: rapidsai/rmm#480
xref: dask/distributed#4034

github-actions · 2021-02-16T21:18:36Z

This issue has been marked rotten due to no recent activity in the past 90d. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.

jakirkham · 2021-02-18T00:23:17Z

cc @pentschev (for awareness)

vyasr · 2022-10-21T18:41:43Z

@shwina Is this intended to be unconditional, or only when requested by the user? We will also need to start passing streams down to libcudf everywhere in order to support this.

pentschev · 2022-10-21T19:00:09Z

My take on this is that we could begin with an opt-in solution, like we've done for CuPy with CUPY_CUDA_PER_THREAD_DEFAULT_STREAM. This could allow us to test things without being too intrusive, and if this proves to be useful then we could enable it by default.

jakirkham · 2022-10-21T19:25:42Z

Yeah agreed. We probably want to support both the current mode and PTDS for some time. There are likely different issues we will run into. So having an escape hatch to old behavior is quite useful.

shwina added feature request New feature or request Needs Triage Need team to review and classify labels Aug 11, 2020

shwina added Python Affects Python cuDF API. and removed Needs Triage Need team to review and classify labels Aug 11, 2020

jakirkham mentioned this issue Aug 11, 2020

[WIP] Build RMM's Python bindings with PTDS rapidsai/rmm#480

Closed

github-actions bot added the rotten label Feb 16, 2021

vyasr mentioned this issue Oct 21, 2022

Always enable per-thread default stream #11281

Closed

vyasr removed the inactive-90d label Feb 23, 2024

vyasr added this to cuDF Python Nov 5, 2024

github-project-automation bot moved this to Todo in cuDF Python Nov 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Pass `cudaStreamPerThread` to numba/CuPy kernels #5922

[FEA] Pass `cudaStreamPerThread` to numba/CuPy kernels #5922

shwina commented Aug 11, 2020

jakirkham commented Aug 11, 2020 •

edited

Loading

github-actions bot commented Feb 16, 2021

jakirkham commented Feb 18, 2021

vyasr commented Oct 21, 2022

pentschev commented Oct 21, 2022

jakirkham commented Oct 21, 2022

[FEA] Pass cudaStreamPerThread to numba/CuPy kernels #5922

[FEA] Pass cudaStreamPerThread to numba/CuPy kernels #5922

Comments

shwina commented Aug 11, 2020

jakirkham commented Aug 11, 2020 • edited Loading

github-actions bot commented Feb 16, 2021

jakirkham commented Feb 18, 2021

vyasr commented Oct 21, 2022

pentschev commented Oct 21, 2022

jakirkham commented Oct 21, 2022

[FEA] Pass `cudaStreamPerThread` to numba/CuPy kernels #5922

[FEA] Pass `cudaStreamPerThread` to numba/CuPy kernels #5922

jakirkham commented Aug 11, 2020 •

edited

Loading