Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use cuda events API to make profiler better on GPU #6419

Open
abadams opened this issue Nov 15, 2021 · 2 comments
Open

Use cuda events API to make profiler better on GPU #6419

abadams opened this issue Nov 15, 2021 · 2 comments
Labels
enhancement New user-visible features or improvements to existing features. gsoc Potential Google Summer of Code projects

Comments

@abadams
Copy link
Member

abadams commented Nov 15, 2021

Currently the sampling profiler doesn't measure anything really meaningful for GPU schedules. Using the CUDA events API we could do better and at least get an accurate per-kernel runtime like nvprof does.

@abadams abadams added enhancement New user-visible features or improvements to existing features. gsoc Potential Google Summer of Code projects labels Nov 15, 2021
@mcourteaux
Copy link
Contributor

I'm not 100% sure what you suggest here, but I'd like to point out that I managed to run nvprof and visualize the result with nvvp of GPU-enabled Halide applications. Very insightful!
image

I'm not aware of a sampling profiler for Halide. Are you talking about just the Linux perf tool?

@abadams
Copy link
Member Author

abadams commented Nov 18, 2021

Halide has a built-in sampling profiler that turns on when you add the -profile target flag. It's not particularly useful for the GPU though, which is what this issue is about. I also use nvvp currently for GPU schedules.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New user-visible features or improvements to existing features. gsoc Potential Google Summer of Code projects
Projects
None yet
Development

No branches or pull requests

2 participants