Allow users to use their own cuda contexts and streams in JIT mode #6345

abadams · 2021-10-23T00:23:37Z

By exposing the acquire_cuda_context and friends methods in the same way as other JIT runtime overrides, we can let people replace them by swapping in their own handlers for those methods (see the new test)

This PR is a bit of a can-kick, because it's a piecemeal function-pointer-based exposure of three new runtime methods, rather than a single coherent way to replace any parts of the runtime you want. It should solve the problem in the short term, however.

Builds on #6344

steven-johnson

Should we have something like jit_handlers().reset() to set all the handlers back to their defaults?

steven-johnson · 2021-10-25T17:01:57Z

src/JITModule.h

@@ -28,16 +28,130 @@ struct JITUserContext;

 /** A set of custom overrides of runtime functions */
 struct JITHandlers {
+    /** Set the function called to print messages from the runtime.
+     * If you are compiling statically, you can also just define your


The comments about "if you are compiling statically" are arguably confusing and/or misplaced here. I'd suggest replacing everything after the first sentence with something like "(Note that this applies only when jitting code; if you are doing ahead-of-time compilation, see [README_foo.md or wherever we document this, rather than trying to replicate that documentation here])".

Done (in underlying branch)

Should we have something like jit_handlers().reset() to set all the handlers back to their defaults?

It's just a struct, so you can say my_func.jit_handlers() = JITHandlers{} to reset it. Not sure if it's worth adding a method.

steven-johnson · 2021-10-25T17:03:03Z

src/runtime/HalideRuntimeCuda.h

@@ -65,6 +65,23 @@ extern uintptr_t halide_cuda_get_device_ptr(void *user_context, struct halide_bu
 * driver. See halide_reuse_device_allocations. */
 extern int halide_cuda_release_unused_device_allocations(void *user_context);

+// These typedefs treat both a CUcontext and a CUstream as a void *,
+// to avoid dependencies on cuda headers.
+typedef int (*halide_cuda_acquire_context_t)(void *,   // user_context


(Completely orthogonal to this PR, but IMHO we should consider migrating typedef bar foo; to using foo = bar; as I think it reads easier and is easier to search for)

I believe this header is supposed to compile in C mode.

and/or with janky legacy toolchains

ahhhh right

(Do we actually compile/run any tests in plain-C mode? If not, we should add one)

…_context_3

These can come up if a JITUserContext is passed to something like copy_to_device before getting fully populated by passing it to a call to realize.

and reuse the runtime's name resolution mechanism instead

This change means we'll only ever create one built-in cuda context in this circumstance.

…_context_3

abadams · 2021-10-26T23:45:13Z

Turns out the gpu_object_lifetime tests were creating both a cuda runtime module and a cuda-debug runtime module, and were thus creating two cuda contexts. The latest changes dedup this and add some comments explaining things.

abadams added 2 commits October 22, 2021 11:52

Deprecate JIT runtime override methods that take void *

17ebf8b

Make it possible to use custom cuda contexts and streams in JIT mode

811f87d

steven-johnson reviewed Oct 25, 2021

View reviewed changes

abadams added 7 commits October 25, 2021 13:41

Clean up comments

d3df50f

Merge branch 'abadams/custom_cuda_context_2' into abadams/custom_cuda…

f6dd0dd

…_context_3

Tolerate null handlers in the JITUserContext

d3e17d2

These can come up if a JITUserContext is passed to something like copy_to_device before getting fully populated by passing it to a call to realize.

Remove reliance on dlsym in test

677dd4f

and reuse the runtime's name resolution mechanism instead

Handle case where cuda and cuda-debug runtime modules both exist

50adbde

This change means we'll only ever create one built-in cuda context in this circumstance.

Slight simplification

48b6912

Merge remote-tracking branch 'origin/master' into abadams/custom_cuda…

d2c64c4

…_context_3

Improve comments

d0cdc15

steven-johnson approved these changes Oct 28, 2021

View reviewed changes

abadams merged commit 1c7388a into master Oct 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow users to use their own cuda contexts and streams in JIT mode #6345

Allow users to use their own cuda contexts and streams in JIT mode #6345

abadams commented Oct 23, 2021

steven-johnson left a comment

steven-johnson Oct 25, 2021

abadams Oct 25, 2021

abadams Oct 26, 2021

steven-johnson Oct 25, 2021

abadams Oct 25, 2021

abadams Oct 25, 2021

steven-johnson Oct 25, 2021

steven-johnson Oct 28, 2021

abadams commented Oct 26, 2021

Allow users to use their own cuda contexts and streams in JIT mode #6345

Allow users to use their own cuda contexts and streams in JIT mode #6345

Conversation

abadams commented Oct 23, 2021

steven-johnson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

abadams commented Oct 26, 2021