-
Notifications
You must be signed in to change notification settings - Fork 109
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The custom CUDA extensions may be uncompatible with PyTorch 1.10.0, possible missing dependency ninja
#9
Comments
Thanks for reminding me the dependency :) Yes, this is expected when the first time you run the code where I turn on the verbose mode of Pytorch jit compilation module for debugging purpose. |
Great to know - I'll use the version with the custom kernels then. Thanks! Edit: Maybe an other question @sunset1995 : I saw you increased what seem to be the batch size here https://github.com/sunset1995/DirectVoxGO/blob/main/run.py#L90 from 16 to 65536, is it safe to reduce it or is it actually leveraged by the rendering kernels? In the previous version of the library, even 16 was too high to run on my GPU which has low memory. I did not try yet to run inference with the new version but I will let you know how to goes memory-wise. (Edit again: well reducing to 32768 is fine this time for my memory. Weird.) Edit again: Just to let you know, but probably you are aware: the old trained models have missing keys with the new version, see e.g. the error during inference
New trained models are fine (and run faster which is cool!). |
Hi, Also, is there a way to disable the new optimization related changes in the new repo if that is causing issues? |
It seems that window OS have some issues with this. |
I solved this problem "RuntimeError("Ninja is required to load C++ extensions")" by installing
|
@Learningm did you resolve it on windows as well? |
@aarrushi I resolved it on Linux. |
Hello,
I run the training in a google colab instance.
My code is roughly the following (in different cells):
First error coming out (in the last line of the above), I need
ninja
:Then, I do
Rerunning the line
!python run.py --config configs/blendedmvs/Jade.py --render_test # train
, I now get a very ugly output, that I put on pastebin because it is too long: https://pastebin.com/Um0cMJfg . Notice that despite the warnings, the training starts at the end.Is this output expected? If these warnings/errors appear due to PyTorch 1.10.0 and are critical, I would suggest to disable those custom CUDA extensions by default so that training can run on several PyTorch versions, not only 1.8.1.
Thanks a lot!
The text was updated successfully, but these errors were encountered: