-
Notifications
You must be signed in to change notification settings - Fork 357
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Installation failed with cmake error #355
Comments
Hi @RuiWang1998, could you share the command you use for installation and a full error message that you are getting? Thank you! |
Hi @ptrendx, we used both |
Hi @ptrendx, after a little digging, we think we have located the problem but not sure what's the solution here:
Seems like we are missing some headers, where can we include one? We have machines with CUDA 11.8 and machines with CUDA 12 and we believe they share the same reason here. |
Hi, Some updates, our machines with H800 can successfully install now but A100 machines cannot yet. H800 machines just needed CUDNN but A100 machines, even after installation of CUDNN, still meets the error above. |
Hi, this is a pretty strange error - functions like |
Hi, yes it is in |
Any update on this issue? |
Hi, @MicPie , We have been able to install this with newer commits now. Were you trying on stable releases? |
I have the same problem in my workstation with A6000 ada.
@RuiWang1998 Could you help me what should I do? Install CUDNN? |
Hi,
You would have to modify setup.py and make it output the actual error message (maybe by manual input of commands in terminal) s.t. we can know exactly what is going on.
Best,
Rui
On Nov 21, 2023 at 5:05 PM +0800, mahdip72 ***@***.***>, wrote:
I have the same problem in my workstation with A6000 ada.
raise RuntimeError(f"Error when running CMake: {e}")
RuntimeError: Error when running CMake: Command '['/usr/bin/cmake', '-S', '/tmp/pip-req-build-hnl1xnl7/transformer_engine', '-B', '/tmp/tmp6vkf06mc', '-DCMAKE_BUILD_TYPE=Release', '-DCMAKE_INSTALL_PREFIX=/tmp/pip-req-build-hnl1xnl7/build/lib.linux-x86_64-cpython-311']' returned non-zero exit status 1.
[end of output]
note: This error originates from a subprocess, and is likely not a problem with pip.
ERROR: Failed building wheel for transformer-engine
@RuiWang1998<https://github.com/RuiWang1998> Could you help me what should I do? Install CUDNN?
—
Reply to this email directly, view it on GitHub<#355 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AHUU7JFXB74O7EPHGY5HJULYFRVGNAVCNFSM6AAAAAA3CJV7S2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQMRQGUYDGOJSHA>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
@RuiWang1998 Could you show which release version that you use ? I had the same problems. Thanks. |
Same issue File "/aml2/TransformerEngine/setup.py", line 338, in _build_cmake |
The CMake error message should already be printed to stderr, although it is somewhat buried within the Python stacktrace from If the error is happening during CMake configuration, it's probably because CUDA or cuDNN are not properly installed. See CUDA instructions at #700 (comment). For cuDNN, make sure |
I solved this issue by simply use this command
Under the TransformerEngine dir, I hope this might help you. |
I also meet the question. the question details information is : raise RuntimeError(f"Error when running CMake: {e}") My environment is below: |
Hello, my friend! nvcc --version If error occurs, you may fix it by |
@wplf yeah! my nvcc is seem ok! the information is below:
|
Can you check your cmake version? |
@wplf (yuxunlian) ubuntu@ip-172-31-38-93:~$ cmake --version Is this version appropriate? |
Yes, this is ok。 |
@wplf |
Any update on this issue? I'm still getting the same error. |
If you are experiencing an error that looks like cd transformer_engine
pip install -v -v -v . Some common build errors and fixes:
I'll lock this issue to make this comment easier for users to find, but please open a new issue if you are encountering a build error (with enough of the build log for us to help). |
Hi,
We are testing our new Hopper machines (H800/H100) and trying to use fp8 for training for the first time, but are having trouble installing
TransformerEngine
. It reportsRuntimeError: Error when running CMake: Command '['/usr/local/bin/cmake', '-S', '/tmp/pip-req-build-p6kjladj/transformer_engine', '-B', '/tmp/tmps08o01xi', '-DCMAKE_BUILD_TYPE=Release', '-DCMAKE_INSTALL_PREFIX=/tmp/pip-req-build-p6kjladj/build/lib.linux-x86_64-cpython-310', '-GNinja']' returned non-zero exit status 1.
.We tried to invoke the command outside of pip and it just reports that there are no source directory.
We are trying docker right now but our internet configuration does not let us use docker very conveniently so we usually would prefer not use it. Could you should us where we could find any clues on how we can proceed? Much appreciated.
The text was updated successfully, but these errors were encountered: