fatal error: driver_types.h: No such file or directory #93

bpinaya · 2018-12-18T14:57:04Z

Hi there, this issue is related to #81 so I'll also tag @goldgeisser .
After I fixed the symlink I was still having that fatal error: driver_types.h: No such file or directory, the complete log is:

[  2%] Running gen_proto.py on onnx/onnx.in.proto
Processing /workspace/onnx-tensorrt/third_party/onnx/onnx/onnx.in.proto
Writing /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx_onnx2trt_onnx.proto
Writing /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx_onnx2trt_onnx.proto3
Writing /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx.pb.h
generating /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx_pb.py
[  4%] Running C++ protocol buffer compiler on /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx_onnx2trt_onnx.proto
[  4%] Built target gen_onnx_proto
[  6%] Running gen_proto.py on onnx/onnx-operators.in.proto
Processing /workspace/onnx-tensorrt/third_party/onnx/onnx/onnx-operators.in.proto
Writing /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx-operators_onnx2trt_onnx.proto
Writing /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx-operators_onnx2trt_onnx.proto3
Writing /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx-operators.pb.h
generating /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx_operators_pb.py
[  8%] Running C++ protocol buffer compiler on /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx-operators_onnx2trt_onnx.proto
Scanning dependencies of target onnx_proto
[ 11%] Building CXX object third_party/onnx/CMakeFiles/onnx_proto.dir/onnx/onnx_onnx2trt_onnx.pb.cc.o
/workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx_onnx2trt_onnx.pb.cc:598:13: warning: 'dynamic_init_dummy_onnx_2fonnx_5fonnx2trt_5fonnx_2eproto' defined but not used [-Wunused-variable]
 static bool dynamic_init_dummy_onnx_2fonnx_5fonnx2trt_5fonnx_2eproto = []() { AddDescriptors_onnx_2fonnx_5fonnx2trt_5fonnx_2eproto(); return true; }();
             ^
[ 13%] Building CXX object third_party/onnx/CMakeFiles/onnx_proto.dir/onnx/onnx-operators_onnx2trt_onnx.pb.cc.o
/workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx-operators_onnx2trt_onnx.pb.cc:204:13: warning: 'dynamic_init_dummy_onnx_2fonnx_2doperators_5fonnx2trt_5fonnx_2eproto' defined but not used [-Wunused-variable]
 static bool dynamic_init_dummy_onnx_2fonnx_2doperators_5fonnx2trt_5fonnx_2eproto = []() { AddDescriptors_onnx_2fonnx_2doperators_5fonnx2trt_5fonnx_2eproto(); return true; }();
             ^
[ 15%] Linking CXX static library libonnx_proto.a
[ 20%] Built target onnx_proto
[ 22%] Building CUDA object CMakeFiles/nvonnxparser_plugin.dir/FancyActivation.cu.o
[ 24%] Building CUDA object CMakeFiles/nvonnxparser_plugin.dir/ResizeNearest.cu.o
[ 26%] Building CUDA object CMakeFiles/nvonnxparser_plugin.dir/Split.cu.o
[ 28%] Building CXX object CMakeFiles/nvonnxparser_plugin.dir/InstanceNormalization.cpp.o
In file included from /workspace/onnx-tensorrt/InstanceNormalization.hpp:27:0,
                 from /workspace/onnx-tensorrt/InstanceNormalization.cpp:23:
/usr/include/cudnn.h:63:26: fatal error: driver_types.h: No such file or directory
compilation terminated.
CMakeFiles/nvonnxparser_plugin.dir/build.make:101: recipe for target 'CMakeFiles/nvonnxparser_plugin.dir/InstanceNormalization.cpp.o' failed
make[2]: *** [CMakeFiles/nvonnxparser_plugin.dir/InstanceNormalization.cpp.o] Error 1
CMakeFiles/Makefile2:185: recipe for target 'CMakeFiles/nvonnxparser_plugin.dir/all' failed
make[1]: *** [CMakeFiles/nvonnxparser_plugin.dir/all] Error 2
Makefile:151: recipe for target 'all' failed
make: *** [all] Error 2

To rule out installation problems I might have commited I decided to reproduce it in a container.
I'm pulling this container (from nvidia-docker at https://ngc.nvidia.com)

docker pull nvcr.io/nvidia/tensorrt:18.11-py3

And running with:

nvidia-docker run -it --rm nvcr.io/nvidia/tensorrt:18.11-py3

so the versions are the following:

cmake cmake --version:

cmake version 3.12.1

CMake suite maintained and supported by Kitware (kitware.com/cmake).

gcc gcc --version:

gcc (Ubuntu 5.4.0-6ubuntu1~16.04.10) 5.4.0 20160609
Copyright (C) 2015 Free Software Foundation, Inc.
This is free software; see the source for copying conditions.  There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

nvidia-smi:

Tue Dec 18 14:39:10 2018       
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 410.48                 Driver Version: 410.48                    |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  GeForce GTX 108...  Off  | 00000000:03:00.0 Off |                  N/A |
| 28%   25C    P8     8W / 250W |      2MiB / 11178MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+
|   1  GeForce GTX 108...  Off  | 00000000:04:00.0  On |                  N/A |
| 23%   40C    P8    12W / 250W |   1099MiB / 11175MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID   Type   Process name                             Usage      |
|=============================================================================|
+-----------------------------------------------------------------------------+

nvcc nvcc --version:

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2018 NVIDIA Corporation
Built on Sat_Aug_25_21:08:01_CDT_2018
Cuda compilation tools, release 10.0, V10.0.130

tensorrt dpkg -l | grep -i tensorrt

ii  libnvinfer-dev                5.0.2-1+cuda10.0                      amd64        TensorRT development libraries and headers
ii  libnvinfer-samples            5.0.2-1+cuda10.0                      all          TensorRT samples and documentation
ii  libnvinfer5                   5.0.2-1+cuda10.0                      amd64        TensorRT runtime libraries
ii  python3-libnvinfer            5.0.2-1+cuda10.0                      amd64        Python 3 bindings for TensorRT
ii  python3-libnvinfer-dev        5.0.2-1+cuda10.0                      amd64        Python 3 development package for TensorRT
ii  tensorrt                      5.0.2.6-1+cuda10.0                    amd64        Meta package of TensorRT

protobuf (latest version installed from repo)

Also the output of locate driver_types.h is empty but the symlink for cuda seems to be there. Since the output of ll /usr/local/ is:

total 64
drwxr-xr-x 1 root root 4096 Nov  3 01:57 ./
drwxr-xr-x 1 1000 1000 4096 Nov 27  2017 ../
drwxr-xr-x 1 root root 4096 Dec 18 14:24 bin/
lrwxrwxrwx 1 root root    9 Nov  3 01:33 cuda -> cuda-10.0/
drwxr-xr-x 1 root root 4096 Nov  3 01:45 cuda-10.0/
drwxr-xr-x 3 root root 4096 Nov  3 01:57 doc/
drwxr-xr-x 2 root root 4096 Oct  5 18:03 etc/
drwxr-xr-x 2 root root 4096 Oct  5 18:03 games/
drwxr-xr-x 1 root root 4096 Dec 18 14:24 include/
drwxr-xr-x 1 root root 4096 Dec 18 14:24 lib/
lrwxrwxrwx 1 root root    9 Oct  5 18:03 man -> share/man/
drwxr-xr-x 7 root root 4096 Nov  3 01:42 mpi/
drwxr-xr-x 2 root root 4096 Oct  5 18:03 sbin/
drwxr-xr-x 1 root root 4096 Nov  3 01:57 share/
drwxr-xr-x 2 root root 4096 Oct  5 18:03 src/

Only after passing the cuda include dir variable to cmake I was able to solve that:

cmake -DCUDA_INCLUDE_DIRS=/usr/local/cuda-10.0/include -DTENSORRT_ROOT=/opt/tensorrt ..

What I found weird is that even if the symlink variable seems to be pointing to the correct location I couldn't get it to build without passing that variable. Maybe a note in the Readme would suffice.

Or could it be some cmake shenanigans? I'll close the issue after I get some feedback since it's easily fixable, just wanted it to be here if anyone encounters something similar so they can have some insight.

Also, maybe some CI would be nice, I volunteer to set it up in either travis, circle or maybe even jenkins if Nvida (@benbarsdell) can provide a gpu enabled container. @yinghai I could also set up code formatting and some linting so it'll be easier to contribute, maybe an image from ngc?. I'm planning on submitting some PRs I did for some layers.

The text was updated successfully, but these errors were encountered:

yinghai · 2018-12-18T18:19:52Z

Yes, I really think we should have some CI for this repo. @benbarsdell, could you check with your folks to see if we can have some GPU machine for CI? Thanks.

goldgeisser · 2018-12-18T18:45:47Z

@bpinaya An interesting issue. In general, Cmake with native cuda support should be able to find the cuda install and it worked for me an many others. However, you are not alone in experiencing of that issue.

Reading about it more, I found the following paragraph in 3.12 doc.
https://cmake.org/cmake/help/v3.12/module/FindCUDA.html

"...
It might be necessary to set CUDA_TOOLKIT_ROOT_DIR manually on certain platforms, or to use a CUDA runtime not installed in the default location. In newer versions of the toolkit the CUDA library is included with the graphics driver – be sure that the driver version matches what is needed by the CUDA runtime version.
..."

So it looks that the ability of the cmake to find or not depends on a platform and on a driver.
I think it will be worth noting this possible issue in the readme file - I'll do it.

Thank you for pointing this out.

bigrobinson · 2019-07-01T20:09:14Z

I can confirm that

-DCUDA_INCLUDE_DIRS=/usr/local/cuda-10.0/include

is necessary on Jetson Xavier. Thanks @bpinaya.

watershade · 2019-08-11T10:06:23Z

I meet the same problem on jetson nano platform.
I use this command:
cmake .. -DCUDA_INCLUDE_DIRS=/usr/local/cuda/include -DTENSORRT_ROOT=/ust/src/tensorrt -DGPU_ARCHS="53"

And before that, you need to install protobuf firstly.

It is fine for me.

To prevent issue onnx#93, we need to provide CUDA include directory.

mjsML · 2020-01-06T22:32:11Z

I met the same problem building onnx-tensorrt for mxnet

ArtificialNotImbecile · 2020-04-28T04:45:14Z

-DCUDA_INCLUDE_DIRS=/usr/local/cuda-10.0/include

Also works for cuda-10.1& Tesla-T4 GPU

bpinaya closed this as completed Jan 23, 2019

a-wigand mentioned this issue Feb 6, 2019

nvcc fatal : Unsupported gpu architecture 'compute_75' in make #108

Closed

HEBOS added a commit to HEBOS/onnx-tensorrt that referenced this issue Sep 5, 2019

Preventing make from failing

4ce06af

To prevent issue onnx#93, we need to provide CUDA include directory.

HEBOS mentioned this issue Sep 5, 2019

Preventing make from failing #239

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fatal error: driver_types.h: No such file or directory #93

fatal error: driver_types.h: No such file or directory #93

bpinaya commented Dec 18, 2018

yinghai commented Dec 18, 2018

goldgeisser commented Dec 18, 2018

bigrobinson commented Jul 1, 2019

watershade commented Aug 11, 2019

mjsML commented Jan 6, 2020

ArtificialNotImbecile commented Apr 28, 2020

fatal error: driver_types.h: No such file or directory #93

fatal error: driver_types.h: No such file or directory #93

Comments

bpinaya commented Dec 18, 2018

yinghai commented Dec 18, 2018

goldgeisser commented Dec 18, 2018

bigrobinson commented Jul 1, 2019

watershade commented Aug 11, 2019

mjsML commented Jan 6, 2020

ArtificialNotImbecile commented Apr 28, 2020