Vulkan won't run on Ubuntu 20.04 with TITAN RTX and nvidia-driver-450.51.05, any ideas?

0

I'm trying to run a vehicle simulator based on UE4 on a Linux machine with a TITAN RTX. The TITAN works smoothly for CUDA work and documentation says it supports Vulkan, but Vulkan won't run which stops the simulator from running. One possible complication is I'm running nvidia-driver-450 which is an older version, but its the only nvidia driver that's ever worked on this machine.

I'll add for context that my background is in embedded hardware so I am a little over my head with all this graphics stuff. I consider myself an intermediate Linux user but I've never worked with Vulkan or anything like it before. Also, due to the pandemic I only have remote access to the machine because its locked in my lab on a university campus.

vulkaninfo fails with:

$ vulkaninfo
ERROR: [Loader Message] Code 0 : /usr/lib/i386-linux-gnu/libvulkan_radeon.so: wrong ELF class: ELFCLASS32
WARNING: [Loader Message] Code 0 : loader_icd_scan: Can not find 'ICD' object in ICD JSON file /usr/share/vulkan/icd.d/nvidia_layers.json.  Skipping ICD JSON
ERROR: [Loader Message] Code 0 : /usr/lib/i386-linux-gnu/libvulkan_intel.so: wrong ELF class: ELFCLASS32
ERROR at /build/vulkan-tools-1.2.162.1~rc1-1lunarg20.04/vulkaninfo/vulkaninfo.h:248:vkEnumerateInstanceExtensionProperties failed with ERROR_INITIALIZATION_FAILED

Near as I can tell, the line of consequence is the last one, with ERROR_INITIALIZATION_FAILED, but I haven't found many other people with this error on an NVIDIA GPU online. I've tried uninstalling and reinstalling all the drivers and all the Vulkan packages with no luck. I also tried downgrading to Vulkan 1.1 because I found the NVIDIA's release notes from driver 450 which said that driver version supported Vulkan 1.1 specifically, also with no luck.

I'm really hoping this is solveable without changing the NVIDIA drivers because I have already put a whole ton of hours into getting those to work with a more recent version with no luck, but if people think that the nvidia-driver version is the problem than I can post a separate question that deals just with that.

Does anyone have any suggestions for how to proceed or ideas on what might be the problem? I'm very lost and any ideas would be much appreciated.

Some various outputs that might be helpful:

$ nvidia-smi
Sun Jan 31 16:59:01 2021       
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 450.51.05    Driver Version: 450.51.05    CUDA Version: 11.0     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  TITAN RTX           Off  | 00000000:01:00.0 Off |                  N/A |
| 41%   26C    P8    19W / 280W |    116MiB / 24211MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|    0   N/A  N/A      7254      G   /usr/lib/xorg/Xorg                114MiB |
+-----------------------------------------------------------------------------+

from lspci:

$ lspci -k
...
01:00.0 VGA compatible controller: NVIDIA Corporation TU102 [TITAN RTX] (rev a1)
    Subsystem: NVIDIA Corporation TU102 [TITAN RTX]
    Kernel driver in use: nvidia
    Kernel modules: nvidiafb, nouveau, nvidia_drm, nvidia
   ...

also potentially of consequence:

$ glxinfo -B
name of display: :12
display: :12  screen: 0
direct rendering: Yes
Extended renderer info (GLX_MESA_query_renderer):
    Vendor: Mesa/X.org (0xffffffff)
    Device: llvmpipe (LLVM 11.0.0, 256 bits) (0xffffffff)
    Version: 20.2.6
    Accelerated: no
    Video memory: 32031MB
    Unified memory: no
    Preferred profile: core (0x1)
    Max core profile version: 4.5
    Max compat profile version: 3.1
    Max GLES1 profile version: 1.1
    Max GLES[23] profile version: 3.2
OpenGL vendor string: Mesa/X.org
OpenGL renderer string: llvmpipe (LLVM 11.0.0, 256 bits)
OpenGL core profile version string: 4.5 (Core Profile) Mesa 20.2.6
OpenGL core profile shading language version string: 4.50
OpenGL core profile context flags: (none)
OpenGL core profile profile mask: core profile

OpenGL version string: 3.1 Mesa 20.2.6
OpenGL shading language version string: 1.40
OpenGL context flags: (none)

OpenGL ES profile version string: OpenGL ES 3.2 Mesa 20.2.6
OpenGL ES profile shading language version string: OpenGL ES GLSL ES 3.20
linux
drivers
graphics-card
nvidia
asked on Super User Feb 1, 2021 by Sean Geary • edited Feb 1, 2021 by Giacomo1968

0 Answers

Nobody has answered this question yet.


User contributions licensed under CC BY-SA 3.0