-
Notifications
You must be signed in to change notification settings - Fork 917
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Segmentation fault on import cudf without CUDA #11941
Comments
Yikes! Investigating. |
Thanks @shwina! If it helps, I think the bug was introduced somewhere between 22.04 and 22.06? I tried >>> import cudf
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/srv/conda/envs/cudfenv/lib/python3.8/site-packages/cudf/__init__.py", line 5, in <module>
validate_setup()
File "/srv/conda/envs/cudfenv/lib/python3.8/site-packages/cudf/utils/gpu_utils.py", line 20, in validate_setup
from rmm._cuda.gpu import (
File "/srv/conda/envs/cudfenv/lib/python3.8/site-packages/rmm/__init__.py", line 16, in <module>
from rmm import mr
File "/srv/conda/envs/cudfenv/lib/python3.8/site-packages/rmm/mr.py", line 14, in <module>
from rmm._lib.memory_resource import (
File "/srv/conda/envs/cudfenv/lib/python3.8/site-packages/rmm/_lib/__init__.py", line 15, in <module>
from .device_buffer import DeviceBuffer
File "device_buffer.pyx", line 1, in init rmm._lib.device_buffer
TypeError: C function cuda.ccudart.cudaStreamSynchronize has wrong signature (expected __pyx_t_4cuda_7ccudart_cudaError_t (__pyx_t_4cuda_7ccudart_cudaStream_t), got cudaError_t (cudaStream_t)) whereas For older versions ( >>> import cudf
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/srv/conda/envs/cudfenv/lib/python3.8/site-packages/cudf/__init__.py", line 5, in <module>
validate_setup()
File "/srv/conda/envs/cudfenv/lib/python3.8/site-packages/cudf/utils/gpu_utils.py", line 20, in validate_setup
from rmm._cuda.gpu import (
File "/srv/conda/envs/cudfenv/lib/python3.8/site-packages/rmm/__init__.py", line 16, in <module>
from rmm import mr
File "/srv/conda/envs/cudfenv/lib/python3.8/site-packages/rmm/mr.py", line 14, in <module>
from rmm._lib.memory_resource import (
File "/srv/conda/envs/cudfenv/lib/python3.8/site-packages/rmm/_lib/__init__.py", line 15, in <module>
from .device_buffer import DeviceBuffer
ImportError: libcuda.so.1: cannot open shared object file: No such file or directory |
Thanks, @weiji14 -- could I ask you to check if force installing mamba install --no-deps --clobber -c conda-forge cuda-python=11.7.1 |
Oh hang on, that will probably break you in other ways. With TypeError: C function cuda.ccudart.cudaStreamSynchronize has wrong signature (expected __pyx_t_4cuda_7ccudart_cudaError_t (__pyx_t_4cuda_7ccudart_cudaStream_t), got cudaError_t (cudaStream_t)) I'll report back here when I have a proper solution... |
Got this:
So yeah, probably need to do this properly 🙂 |
This should resolve a segfault we are seeing with `cuda-python=11.7.0` (rapidsai/cudf#11941). Authors: - Ashwin Srinath (https://github.com/shwina) - AJ Schmidt (https://github.com/ajschmidt8) - Bradley Dice (https://github.com/bdice) Approvers: - GALI PREM SAGAR (https://github.com/galipremsagar) - Bradley Dice (https://github.com/bdice) - Mark Harris (https://github.com/harrism) - AJ Schmidt (https://github.com/ajschmidt8)
This should resolve a segfault we are seeing with `cuda-python=11.7.0` (#11941). Authors: - Ashwin Srinath (https://github.com/shwina) - Bradley Dice (https://github.com/bdice) - GALI PREM SAGAR (https://github.com/galipremsagar) - Jordan Jacobelli (https://github.com/Ethyling) Approvers: - GALI PREM SAGAR (https://github.com/galipremsagar) - Bradley Dice (https://github.com/bdice)
This should resolve a segfault we are seeing with `cuda-python=11.7.0` (rapidsai/cudf#11941). Authors: - Ashwin Srinath (https://github.com/shwina) - Bradley Dice (https://github.com/bdice) - Ray Douglass (https://github.com/raydouglass) Approvers: - Bradley Dice (https://github.com/bdice) - Brad Rees (https://github.com/BradReesWork) - Ray Douglass (https://github.com/raydouglass) - GALI PREM SAGAR (https://github.com/galipremsagar)
This should resolve a segfault we are seeing with `cuda-python=11.7.0` (rapidsai/cudf#11941). Authors: - Ashwin Srinath (https://github.com/shwina) - Bradley Dice (https://github.com/bdice) Approvers: - GALI PREM SAGAR (https://github.com/galipremsagar) - Bradley Dice (https://github.com/bdice) - Dante Gama Dessavre (https://github.com/dantegd) - Ray Douglass (https://github.com/raydouglass)
@weiji14 We've released
Let us know if you still have any issues. |
Closing this issue as the segfaults are fixed. Please free to re-open if this re-surfaces. |
This should resolve a segfault we are seeing with `cuda-python=11.7.0` (rapidsai/cudf#11941). Authors: - Ashwin Srinath (https://github.com/shwina) - Bradley Dice (https://github.com/bdice) Approvers: - GALI PREM SAGAR (https://github.com/galipremsagar) - Bradley Dice (https://github.com/bdice) - Dante Gama Dessavre (https://github.com/dantegd) - Ray Douglass (https://github.com/raydouglass)
Describe the bug
Related to #11366. On a machine without NVIDIA GPUs,
import cudf
results in a RuntimeError followed by a segmentation fault. Context is that I have some code intended to be cross-compatible between CPU and GPU which used to work withcudf=21.10
, but recent versions (e.g.cudf=22.10
) results in an unrecoverable segfault.Steps/Code to reproduce bug
Follow this guide http://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reports to craft a minimal bug report. This helps us reproduce the issue you're having and resolve the issue more quickly.
On a computer without an NVIDIA GPU, run these installation steps:
Then in a Python console, run:
results in
Expected behavior
A clear and concise description of what you expected to happen.
Ideally,
import cudf
would just result in anImportError
(or some other error) without a segmentation fault.Environment overview (please complete the following information)
Environment details
Please run and paste the output of the
cudf/print_env.sh
script here, to gather any other relevant environment detailsClick here to see environment details
Additional context
Add any other context about the problem here.
Xref weiji14/deepicedrain@21e0e99
#11941 (comment)
The text was updated successfully, but these errors were encountered: