Add robust dependency check for Python package #6436

ivanst0 · 2021-01-25T19:54:08Z

Description:

Use the correct version of CUDA when multiple versions are installed on the machine.
- A .py file is generated during CMake build and then used at runtime to check for the appropriate version of dependencies.
- cuDNN is expected to be installed at same location as CUDA Toolkit, as described in the official installation instructions from NVIDIA.
- Python 3.6 and 3.7 rely on PATH environment variable for locating CUDA/cuDNN DLLs, while in Python 3.8 and 3.9 a new API (os.add_dll_directory) is used to specify DLL search location.
- The new method of locating CUDA is also more robust: previously it could happen that everything works just fine if import torch is placed before import onnxruntime, but an error occurs if the order of these imports is switched (onnxruntime could end up using CUDA DLLs distributed with PyTorch).
Provide an informative error message to the user in case one of dependencies is missing.
- onnxruntime-gpu package depends on CUDA, cuDNN and C++ Runtime (if built with VS 2019).
- The missing dependency and the required version are printed as part of the error message.
- The check for VS 2019 C++ Runtime is now more robust.
Build onnxruntime-gpu Python package for Python 3.8/3.9 in CI.

Motivation and Context:

Having more than one version of CUDA installed on the machine is a quite common scenario. onnxruntime-gpu package should work out-of-the-box on such a machine, without the need for manual configuration.
The users are often confused when import onnxruntime complains about not being able to find a DLL - it's not obvious which dependency is missing.
Python 3.8 edition of onnxruntime-gpu package is still missing on PyPI. This change should help producing robust packages for all supported versions of Python.

Affected components:

onnxruntime and onnxruntime-gpu Python packages for Windows for all versions of Python.

fixes #5697
fixes #5963
fixes #6433
fixes #6435

snnn · 2021-01-25T20:27:47Z

/azp run orttraining-distributed

azure-pipelines · 2021-01-25T20:27:58Z

Azure Pipelines successfully started running 1 pipeline(s).

ivanst0 · 2021-01-27T18:59:15Z

@snnn, @jywu-msft
Could you please place cuDNN DLLs in the same directory as CUDA DLLs on machines in Win-GPU-2019 agent pool? This follows the official cuDNN installation instructions from NVIDIA and better represents the setup that end users are likely to have on their machines.

This should solve the only remaining test failure (in Windows GPU CI Pipeline).

snnn · 2021-01-27T23:01:28Z

@snnn, @jywu-msft
Could you please place cuDNN DLLs in the same directory as CUDA DLLs on machines in Win-GPU-2019 agent pool? This follows the official cuDNN installation instructions from NVIDIA and better represents the setup that end users are likely to have on their machines.

This should solve the only remaining test failure (in Windows GPU CI Pipeline).

I can do it. But now I'm busily working on Security Development Lifecycle (SDL) items. I won't have extra time to work on this in this week or the next week.

ivanst0 · 2021-02-04T15:23:43Z

@jywu-msft, could you please take a look at this PR?

jywu-msft · 2021-02-05T02:03:57Z

@jywu-msft, could you please take a look at this PR?

thanks. Will take a look. was OOF so still catching up on things.

onnxruntime/__init__.py

ivanst0 · 2021-02-05T16:26:43Z

Merged the latest master to pick up the recent fix for Windows build (caused by update to numpy 1.20).
Please kick off the builds again.

faxu · 2021-02-06T01:44:51Z

/azp run Linux CPU CI Pipeline, Linux CPU x64 NoContribops CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, MacOS NoContribops CI Pipeline, Windows CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline

faxu · 2021-02-06T01:45:04Z

/azp run Windows GPU CI Pipeline, WIndows GPU TensorRT CI Pipeline, centos7_cpu, centos7_cpu (linux_centos_ci Debug), centos7_cpu (linux_centos_ci Release), orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline

azure-pipelines · 2021-02-06T01:45:31Z

Azure Pipelines successfully started running 5 pipeline(s).

azure-pipelines · 2021-02-06T01:45:40Z

Azure Pipelines successfully started running 9 pipeline(s).

snnn · 2021-02-06T02:15:59Z

/azp run orttraining-distributed, Linux Nuphar CI Pipeline

azure-pipelines · 2021-02-06T02:16:14Z

Azure Pipelines successfully started running 2 pipeline(s).

jywu-msft

please test the code as I documented. it doesn't work.

ivanst0 · 2021-02-06T07:01:25Z

please test the code as I documented. it doesn't work.

"Windows GPU CI Pipeline" completed successfully. Do you believe something is missing in the pipeline?

snnn · 2021-02-06T07:08:25Z

please test the code as I documented. it doesn't work.

"Windows GPU CI Pipeline" completed successfully. Do you believe something is missing in the pipeline?

That pipeline is using python 3.7.

…o match

ivanst0 · 2021-02-07T22:11:36Z

/azp run Windows GPU CI Pipeline

azure-pipelines · 2021-02-07T22:11:40Z

Commenter does not have sufficient privileges for PR 6436 in repo microsoft/onnxruntime

jywu-msft · 2021-02-08T00:58:19Z

please test the code as I documented. it doesn't work.

"Windows GPU CI Pipeline" completed successfully. Do you believe something is missing in the pipeline?

yes, the fix must be tested with python 3.8

… pipeline' (Windows/GPU)

snnn · 2021-02-17T17:55:52Z

So, all the builds passed.

ivanst0 · 2021-02-17T18:48:03Z

please test the code as I documented. it doesn't work.

Python packaging pipeline succeeded for all supported operating systems and Python versions.

ivanst0 · 2021-02-18T09:17:33Z

@jywu-msft, @snnn,
Could you please advise what are the next steps for merging this PR?
All the pipelines completed successfully.
Further waiting only generates additional work (such as resolving a conflict in tools/ci_build/build.py which popped up recently).

outdated

jywu-msft · 2021-02-21T09:03:05Z

@jywu-msft, @snnn,
Could you please advise what are the next steps for merging this PR?
All the pipelines completed successfully.
Further waiting only generates additional work (such as resolving a conflict in tools/ci_build/build.py which popped up recently).

sorry for the delay. things have been busy as of late.
I kicked off python packaging pipeline. after that passes, will kick off the rest of the CI pipelines.

jywu-msft · 2021-02-21T13:02:09Z

/azp run orttraining-linux-ci-pipeline,orttraining-mac-ci-pipeline,orttraining-linux-gpu-ci-pipeline,centos7_cpu,Linux CPU Minimal Build E2E CI Pipeline,Linux Nuphar CI Pipeline,MacOS NoContribops CI Pipeline,Linux OpenVINO CI Pipeline,orttraining-distributed

azure-pipelines · 2021-02-21T13:02:53Z

Azure Pipelines successfully started running 9 pipeline(s).

jywu-msft · 2021-02-21T13:03:35Z

/azp run Linux CPU CI Pipeline,Linux CPU x64 NoContribops CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,MacOS CI Pipeline,MacOS NoContribops CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline

azure-pipelines · 2021-02-21T13:04:17Z

Azure Pipelines successfully started running 9 pipeline(s).

jywu-msft · 2021-02-21T13:05:16Z

/azp run orttraining-amd-gpu-ci-pipeline

azure-pipelines · 2021-02-21T13:05:25Z

Azure Pipelines successfully started running 1 pipeline(s).

Add robust dependency check for Python package

64f9d6c

ivanst0 requested a review from a team as a code owner January 25, 2021 19:54

snnn assigned jywu-msft Jan 25, 2021

ivanst0 added 5 commits January 25, 2021 21:59

Add version_info.py to .gitignore

c561fd9

Fix Linux build

12c9c94

Fix Windows CPU build

78300c3

Fix Windows 32-bit build

0a24716

Minor tweak

91d7d9a

jywu-msft reviewed Feb 5, 2021

View reviewed changes

onnxruntime/__init__.py Show resolved Hide resolved

Merge the latest master to fix Windows build

dbc1949

jywu-msft previously requested changes Feb 6, 2021

View reviewed changes

ivanst0 added 3 commits February 6, 2021 08:57

Generate version_info.py earlier in onnxruntime_python.cmake

65043d1

Print a user-friendly message if cuDNN is not found in

7d6b635

Relax version requirements for CUDA 11 - only the major version has t…

1d6a5fc

…o match

ivanst0 added 4 commits February 8, 2021 10:13

Merge the latest master to pick up CUDA 11 support in build pipelines

7950a66

Fix PATH environment variable to include CUDA 11 in 'Python packaging…

ad1a589

… pipeline' (Windows/GPU)

Fix the build with cuDNN 7

844d84b

Merge the latest master to pick up fixes in Python packaging pipeline

5bbf267

Merge the latest master and resolve the conflict in build.py

100a799

ivanst0 requested review from snnn and jywu-msft February 20, 2021 12:03

jywu-msft approved these changes Feb 21, 2021

View reviewed changes

jywu-msft merged commit c91f314 into microsoft:master Feb 21, 2021

skottmckay mentioned this pull request Jun 7, 2021

v1.8.0 raises Exception if cudnn not found in Program Files #7965

Closed

ivanst0 mentioned this pull request Jun 23, 2021

Fix Python Cuda loading issues #7939

Merged

guoyu-wang mentioned this pull request Mar 17, 2022

ORT python API raises ImportError if windows is not installed at C:\Windows or vcruntime140_1.dll is not installed under C:\Windows\System32 #10924

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add robust dependency check for Python package #6436

Add robust dependency check for Python package #6436

ivanst0 commented Jan 25, 2021 •

edited

Loading

snnn commented Jan 25, 2021

azure-pipelines bot commented Jan 25, 2021

ivanst0 commented Jan 27, 2021

snnn commented Jan 27, 2021

ivanst0 commented Feb 4, 2021

jywu-msft commented Feb 5, 2021

ivanst0 commented Feb 5, 2021

faxu commented Feb 6, 2021

faxu commented Feb 6, 2021

azure-pipelines bot commented Feb 6, 2021

azure-pipelines bot commented Feb 6, 2021

snnn commented Feb 6, 2021

azure-pipelines bot commented Feb 6, 2021

jywu-msft left a comment

ivanst0 commented Feb 6, 2021

snnn commented Feb 6, 2021

ivanst0 commented Feb 7, 2021

azure-pipelines bot commented Feb 7, 2021

jywu-msft commented Feb 8, 2021

snnn commented Feb 17, 2021

ivanst0 commented Feb 17, 2021

ivanst0 commented Feb 18, 2021

jywu-msft commented Feb 21, 2021

jywu-msft commented Feb 21, 2021

azure-pipelines bot commented Feb 21, 2021

jywu-msft commented Feb 21, 2021

azure-pipelines bot commented Feb 21, 2021

jywu-msft commented Feb 21, 2021

azure-pipelines bot commented Feb 21, 2021

Add robust dependency check for Python package #6436

Add robust dependency check for Python package #6436

Conversation

ivanst0 commented Jan 25, 2021 • edited Loading

snnn commented Jan 25, 2021

azure-pipelines bot commented Jan 25, 2021

ivanst0 commented Jan 27, 2021

snnn commented Jan 27, 2021

ivanst0 commented Feb 4, 2021

jywu-msft commented Feb 5, 2021

ivanst0 commented Feb 5, 2021

faxu commented Feb 6, 2021

faxu commented Feb 6, 2021

azure-pipelines bot commented Feb 6, 2021

azure-pipelines bot commented Feb 6, 2021

snnn commented Feb 6, 2021

azure-pipelines bot commented Feb 6, 2021

jywu-msft left a comment

Choose a reason for hiding this comment

ivanst0 commented Feb 6, 2021

snnn commented Feb 6, 2021

ivanst0 commented Feb 7, 2021

azure-pipelines bot commented Feb 7, 2021

jywu-msft commented Feb 8, 2021

snnn commented Feb 17, 2021

ivanst0 commented Feb 17, 2021

ivanst0 commented Feb 18, 2021

jywu-msft commented Feb 21, 2021

jywu-msft commented Feb 21, 2021

azure-pipelines bot commented Feb 21, 2021

jywu-msft commented Feb 21, 2021

azure-pipelines bot commented Feb 21, 2021

jywu-msft commented Feb 21, 2021

azure-pipelines bot commented Feb 21, 2021

ivanst0 commented Jan 25, 2021 •

edited

Loading