We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cuda::(std::)memcpy
I would like an optimized cuda::memcpy function that takes a CG to do the copy and handles alignment and doing vectorized copies when possible.
cuda::memcpy
Much of the machinery is already available in cuda::memcpy_async, but it needs to be exposed as a public cuda::memcpy function.
cuda::memcpy_async
The text was updated successfully, but these errors were encountered:
std::memcpy
As mentioned, a lot of the machinery exists. We still need to do peeling, I think, and validate codegen.
Sorry, something went wrong.
griwes
No branches or pull requests
I would like an optimized
cuda::memcpy
function that takes a CG to do the copy and handles alignment and doing vectorized copies when possible.Much of the machinery is already available in
cuda::memcpy_async
, but it needs to be exposed as a publiccuda::memcpy
function.The text was updated successfully, but these errors were encountered: