Changes to run on CUDA with TchBackend #8

jbrough · 2023-08-16T14:14:36Z

This library is awesome, thank you. Incredibly fast and a much nicer API than alternatives.

I was hoping it would be the magic bullet that works on M2 and CUDA so that it can be deployed (running services from a MacBook seems the only option with these models!).

I tried last night on AWS with TchBackend and ran into:

Could not run 'aten::empty_strided' with arguments from the 'CUDA' backend.

After that I noticed your chunk branch used the same settings I'd used.

It looks like empty_strided isn't available on CUDA at all, and models using it need to be moved to the CPU.

Is it possible to use alternative methods in the tensor constructors (?) so that it's compatible with both WGPU and CUDA? Or do you have any pointers - did you get it working with Tch initially?

The text was updated successfully, but these errors were encountered:

antimora · 2023-08-16T15:15:10Z

Most likely it's due to an incorrect Cuda version specified. Please see this ticket for a similar problem/resolution: Gadersd/stable-diffusion-burn#2

jbrough · 2023-08-16T17:04:41Z

this worked - thank you.

For anyone else:

I also needed libtorch and to set LIBTORCH, which is alluded to in the ticket, but there's more details here in the tch dependency:

https://github.com/LaurentMazare/tch-rs

jbrough · 2023-08-19T19:16:51Z

@antimora I get "Segmentation fault" as soon as it attempts to load a model when using webgpu on AWS G4ad instance (AMD Radeon Pro V520 GPU supporting Vulcan / OpenGl APIs).

I've tried it with AutoGraphicsApi, and Vulcan/OpenGl specifically.

Is there anything I should try or do you think this is a fatal issue atm?

My objective is to get whisper-burn deployable as a service (without shipping a Mac). I'm there with Conda on Tch backend though, but would be good to be able to use AutoGraphicsApi and PR things back into here.

antimora · 2023-08-19T21:59:47Z

@jbrough I am not sure but it's worth looking into this. Do you mind refiling with burn so the team gets needed attention? I don't think anyone else from the burn team is monitoring this repo. You can just copy your last comment. It'd be also super helpful if you could share the actual error you're seeing.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changes to run on CUDA with TchBackend #8

Changes to run on CUDA with TchBackend #8

jbrough commented Aug 16, 2023 •

edited

Loading

antimora commented Aug 16, 2023

jbrough commented Aug 16, 2023

jbrough commented Aug 19, 2023

antimora commented Aug 19, 2023

Changes to run on CUDA with TchBackend #8

Changes to run on CUDA with TchBackend #8

Comments

jbrough commented Aug 16, 2023 • edited Loading

antimora commented Aug 16, 2023

jbrough commented Aug 16, 2023

jbrough commented Aug 19, 2023

antimora commented Aug 19, 2023

jbrough commented Aug 16, 2023 •

edited

Loading