Model loading is too slow with onnxruntime-gpu #5957

korhun · 2020-11-26T21:53:50Z

Using the cpu version, I have no problem. Loading onnx models using "InferenceSession" with onnxruntime-gpu takes >102 seconds for the first model. If we load more than one models, the others take no time at all; we only wait for the first call of "onnxruntime.InferenceSession(onnx_fn)" I have tried with yolov3, yolov3-tiny, yolov4, yolov5, ssd_mobilenet.

System information

OS: Windows 10 x64
ONNX Runtime installed with pip using: "pip3 install onnxruntime-gpu"
ONNX Runtime version: onnxruntime-gpu==1.5.2
Python version: 3.7.9
Using latest version of pycharm
CUDA 10.2 and cuDNN 8.0.3
GPU: GeForce RTX 3070 8GB
CPU/Ram/hard drive: AMD Ryzen 7 3700x / 64GB / ~3000mb/sec io capable SSD

Is this normal? Should I try other versions of CUDA or cuDNN?

korhun · 2020-11-29T10:18:47Z

sess = rt.InferenceSession(onnx_fn) --> this waits for 50 - 200 seconds at the first call.
sess.run(output_names, input_item) --> this waits for 50 - 200 seconds at the first call.

With the same model, the amount of time varies even for the sequential tries. Sometimes I have to wait for 5 minutes and that makes the debuging process very not enjoyable. Any help is appreciated.

korhun · 2020-12-01T22:31:27Z

If:

first --> pip3 install onnxruntime
then --> pip3 install onnxruntime-gpu

yolov5x model took only 2 seconds to load. I was always uninstalling onnxruntime before installing onnxruntime-gpu.

If this was specified in the documents, I'm sorry guys :) If not, I have no idea if this is an important issue to be resolved or not. You decide, and fell free to close this issue.

Keep up the good work..

korhun · 2020-12-29T17:48:27Z

My previous comment is wrong. Uninstalling-installing has nothing to do with this issue. I have tried with another computer which has RXT 3080 GPU and Intel(R) Core(TM) i7-10700F CPU @ 2.90GHz, same issue occurred. I have no idea what made the other computer to load models in 0-1 seconds.

korhun · 2021-01-01T14:30:09Z

I'm not sure this issue is related to onnxruntime or not. These are the steps that resolved this issue in my tries on two computers:

install Cuda 10.2 and use latest cudnn for 10.2
pip install onnxruntime==1.5.2 --upgrade
pip install onnxruntime-gpu==1.5.2 --upgrade
install Cuda 11.0 and use latest cudnn for 11.0 --> resolved this issue for Geforce RTX 3070
install Cuda 11.1 and use latest cudnn for 11.1
install Cuda 11.2 --> issue seems to be resolved for Geforce RTX 3080

apache2046 · 2021-01-28T13:20:42Z

I encounter the same problem

pranavsharma · 2021-02-10T02:49:08Z

@korhun is the issue resolved for you now (based on your earlier comment)? We'll be upgrading to cuda11 in the next release (coming soon).

korhun · 2021-02-10T14:08:55Z

@pranavsharma yes the issue had been resolved in 2 different computers after trying the steps I've mentioned on my previous comment.

Thanks for your interest. I'm looking forward for your next release; keep up the good work.

@radu-matei

Thanks to the work of @radu-matei, @dkim, @dllu, I generated a binding for linux that works with ONNX 1.7 and CUDA 11. This avoid a performance issue with CUDA: microsoft/onnxruntime#5957

@radu-matei

Thanks to the work of @radu-matei, @dkim, @dllu, I generated a binding for linux that works with ONNX 1.7 and CUDA 11. This avoid a performance issue with CUDA: microsoft/onnxruntime#5957

@radu-matei

Thanks to the work of @radu-matei, @dkim, @dllu, I generated a binding for linux that works with ONNX 1.7 and CUDA 11. This avoid a performance issue with CUDA: microsoft/onnxruntime#5957

SystemErrorWang · 2022-01-12T08:50:00Z

@pranavsharma yes the issue had been resolved in 2 different computers after trying the steps I've mentioned on my previous comment.

Thanks for your interest. I'm looking forward for your next release; keep up the good work.

Thank you for providing the solution, I would like to know how should I solve this if my computer has no Nvidia GPU (on MacBook Pro)?
I tried to run pip install onnxruntime==1.5.2 --upgrade, but didn't help

korhun · 2022-01-12T10:19:35Z

@pranavsharma yes the issue had been resolved in 2 different computers after trying the steps I've mentioned on my previous comment.
Thanks for your interest. I'm looking forward for your next release; keep up the good work.

Thank you for providing the solution, I would like to know how should I solve this if my computer has no Nvidia GPU (on MacBook Pro)? I tried to run pip install onnxruntime==1.5.2 --upgrade, but didn't help

@SystemErrorWang there was no issue when working on CPU in any of my tries. I recommend, you uninstall onnxruntime-gpu and install the latest version of onnxruntime.

SystemErrorWang · 2022-01-14T12:15:28Z

@pranavsharma yes the issue had been resolved in 2 different computers after trying the steps I've mentioned on my previous comment.
Thanks for your interest. I'm looking forward for your next release; keep up the good work.

Thank you for providing the solution, I would like to know how should I solve this if my computer has no Nvidia GPU (on MacBook Pro)? I tried to run pip install onnxruntime==1.5.2 --upgrade, but didn't help

@SystemErrorWang there was no issue when working on CPU in any of my tries. I recommend, you uninstall onnxruntime-gpu and install the latest version of onnxruntime.

Thank You korhun, I finally found the problem, I used onnx simplifier and caused this error, the raw onnx works fine

thiagocrepaldi added ep:CUDA issues related to the CUDA execution provider type:performance labels Nov 30, 2020

faxu closed this as completed Feb 12, 2021

radu-matei mentioned this issue Jun 27, 2021

[C API]: Compile ONNX bindings with GPU support deislabs/wasi-nn-onnx#9

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model loading is too slow with onnxruntime-gpu #5957

Model loading is too slow with onnxruntime-gpu #5957

korhun commented Nov 26, 2020 •

edited

Loading

korhun commented Nov 29, 2020

korhun commented Dec 1, 2020

korhun commented Dec 29, 2020

korhun commented Jan 1, 2021 •

edited

Loading

apache2046 commented Jan 28, 2021

pranavsharma commented Feb 10, 2021

korhun commented Feb 10, 2021

SystemErrorWang commented Jan 12, 2022

korhun commented Jan 12, 2022

SystemErrorWang commented Jan 14, 2022

Model loading is too slow with onnxruntime-gpu #5957

Model loading is too slow with onnxruntime-gpu #5957

Comments

korhun commented Nov 26, 2020 • edited Loading

korhun commented Nov 29, 2020

korhun commented Dec 1, 2020

korhun commented Dec 29, 2020

korhun commented Jan 1, 2021 • edited Loading

apache2046 commented Jan 28, 2021

pranavsharma commented Feb 10, 2021

korhun commented Feb 10, 2021

SystemErrorWang commented Jan 12, 2022

korhun commented Jan 12, 2022

SystemErrorWang commented Jan 14, 2022

korhun commented Nov 26, 2020 •

edited

Loading

korhun commented Jan 1, 2021 •

edited

Loading