You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have been using copperhead for quiet some time now and I don't see a way where users may communicate and get some support, so I will post my questions here so that anyone faces similar issues can refer to this (hope this is ok).
So, as the subject line says:
1- I am using copperhead blackscholes sample as a benchmark. I am trying to time both "memory transfers to and from GPU" + kernel launch and execution time. Source for the what I am doing it follows, am I doing this right according to below source?
2- the CPP source generated in the pycache after compiling the above, I need to compile it to a regular binary file (just like nvcc xxx.cu -o my_bin) and then run it using nvidia profiler to investigate some more according to timing obtained from "1" above. How exactly can I do that? this is a crucial thing in my benchmarking report so I need to do it or else I may conclude that it can't be done.
Thanks a lot in advance for all the help and time spent to educate me.
The text was updated successfully, but these errors were encountered:
Hi,
I have been using copperhead for quiet some time now and I don't see a way where users may communicate and get some support, so I will post my questions here so that anyone faces similar issues can refer to this (hope this is ok).
So, as the subject line says:
1- I am using copperhead blackscholes sample as a benchmark. I am trying to time both "memory transfers to and from GPU" + kernel launch and execution time. Source for the what I am doing it follows, am I doing this right according to below source?
SRC
2- the CPP source generated in the pycache after compiling the above, I need to compile it to a regular binary file (just like nvcc xxx.cu -o my_bin) and then run it using nvidia profiler to investigate some more according to timing obtained from "1" above. How exactly can I do that? this is a crucial thing in my benchmarking report so I need to do it or else I may conclude that it can't be done.
Thanks a lot in advance for all the help and time spent to educate me.
The text was updated successfully, but these errors were encountered: