GPU unavailable for openVINO when Intel Atom E3950 NUC is headless - headless

I have installed XRDP on Up2 board running Ubuntu 16.04 and my acceleration on the GPU works fine when I connect a monitor and run it locally. It also works fine when I run it remotely so long as a monitor is connected. Once the monitor is disconnected I get self.exec_net =
self.ie.load_network(network=self.net,device_name=self.plugin)
File "ie_api.pyx", line 178, in
openvino.inference_engine.ie_api.IECore.load_network
File "ie_api.pyx", line 187, in
openvino.inference_engine.ie_api.IECore.load_network
RuntimeError: Failed to create plugin
/opt/intel/openvino_2020.3.194/deployment_tools/inference_engine/lib/intel64/libclDNNPlugin.so
for device GPU
Please, check your environment
[CLDNN ERROR]. clGetPlatformIDs error -1001
Is the GPU being disabled when no monitor is detected? if so, how can I get around this issue?

To load GPU plugin, Intel® Graphics Compute Runtime for OpenCL™ driver components are required to be installed.
In your scenario, GPU plugin is working fine when connecting to a monitor. This issue only happens when the monitor is not connected. Usually, the operating system detects a connected display (monitor) and enables the GPU. The GPU hardware is often disabled when no connected display is detected.
We recommend you post your question in xrdp GitHub for further advice to enable GPU without connecting to a monitor.

Related

Error: couldn't find RGB GLX visual or fbconfig | Prime on demand Nvidia card

I have a Lenovo G580 computer with intel CPU and a Nvidia 610M GPU. Running Linux Lite OS (Ubuntu based).
I would like to use Nvidia prime to run programs with the GPU.
I installed some packages about Nvidia drivers, version 390 according to this page.
With the Nvidia X Server Settings I can switch to on demand mode. On th UI there is only one settings for prime, no mention about the GPU settings.
My problem is that when the on demand mode is enable, many programs (games and glx debug programs) throw this error : (even without asking to use GPU)
Error: couldn't find RGB GLX visual or fbconfig
I know there is other posts like mine on internet however I can't understand the problem or identify a missing package on my computer. Have you already install prime on this GPU ? I can send logs or system info if needed.

Radeon developer panel not detecting running program

I have a vulkan application I want to profile (to find the bottlenecks on the gpu for optimizations). I am on linux and amd hardware so I downloaded the linux version of the radeon developer tools. I ran it and created a local server and that seems to work.
I then launched my program, but it does not appear on the list of profiling candidates in the panel.
As you can see the connection is fine (green dot), but no applications are detected. I have tried with advanced mode as well but no luck.
I know for a fact the program is running as I can see it and use it, recompile it... Has anyone run into this problem before?

How can I update NVIDIA driver especially for OpenCL on Google Colab?

You can use GPU such as T4, P100 (as well as K80) on Google Colaboratory.
However, the default OpenCL driver does not support half precision features.
I want to use them.
Does the latest driver enable them?
And how can I update NVIDIA driver on Google Colaboratory?
The below is what I tried.
I downloaded the latest driver (XXX.run) from NVIDIA site and
!apt remove nvidia-*
# reboot by runtime menu
sh XXX.run
But I got
ERROR: An NVIDIA kernel module 'nvidia-uvm' appears to already be loaded in
your kernel. This may be because it is in use (for example, by an X
server, a CUDA program, or the NVIDIA Persistence Daemon), but this
may also happen if your kernel was configured without support for
module unloading. Please be sure to exit any programs that may be
using the GPU(s) before attempting to upgrade your driver. If no
GPU-based programs are running, you know that your kernel supports
module unloading, and you still receive this message, then an error
may have occured that has corrupted an NVIDIA kernel module's usage
count, for which the simplest remedy is to reboot your computer.
I also failed rmmod nvidia-uvm.
rmmod: ERROR: ../libkmod/libkmod.c:514 lookup_builtin_file() could not open builtin file '/lib/modules/4.14.137+/modules.builtin.bin'
rmmod: ERROR: ../libkmod/libkmod-module.c:793 kmod_module_remove_module() could not remove 'nvidia_uvm': Operation not permitted
rmmod: ERROR: could not remove module nvidia-uvm: Operation not permitted

Debugging (possibly) OpenCV related crash on Jetson TK1

What I am looking for: I need help debugging consistently happening system crashes on my Jetson TK1.
System: I am using a Jetson TK1 board from NVIDIA. Updated to 21.3.4 Grinch Kernel. All drivers installed, libopencv4tegra installed alongside ROS (using hacked deb packages to not overwrite openCV). Everything used to work perfectly in this exact setup.
When the crashes happen: I am running a VSLAM program, which uses a camera connected on the USB port. The program is making heavy use of OpenCV. The program used to run for over 1 month without problems in the current setup. Now, I am getting consistent system crashes which result in a total system freeze. When I am connected over ssh, I loose connection. When I connect a monitor to see what happens on the system while it crashes, I can see everything freeze. The USB port also seems to turn off, since not even USB mouse and keyboard work anymore post-crash. The Jetson stays on though.
Crash Logs: I have tried looking into the /var/log/ logs, but none of them show any messages for when the crash happens.
I have run memtester before. It didn't return any bad memory. While running and crashing, the memory onboard is used at about 60-75% (as shown by "top"). CPU usage is around 60%.
The weird thing is that this exact setup has been running just like this for over a month now.
I need to know: are there any other logs I could find information about the crash in? How could I find out if this is related to a hardware failure or whether there's a software issue?
Thanks
-Marc

Jetson TK1 booting issues

Received my Jetson TK1 yesterday. After unboxing it and configuring the Linux GUI, rebooting the device with a mouse (cordless) attached to its USB 3.0 port takes it to some sort of Command line page where it probably loads some files and then the screen starts printing " [ . ] ". Nothing happens beyond that until I restart the board without any USB peripheral and then the device boots into the normal Linux GUI. Unable to figure out what's wrong with my board and why is it not working properly.(I am a newbie to LINUX)
P.S.: Connecting the monitor via HDMI after switching on the device gives no visual output, just a blank screen. Is it possible to connect the device via network adapter for remote access even it the screen is running blank?
The question is quite old, but as some people might get frustrated with it, I'll provide the answer for most probable cause.
Upgrading the board running 19.X release causes libglx.so to be corrupted. The issue have been actively discussed on NVIDIA forums and the best way to solve it is to upgrade to 21.X.
Otherwise, you can try recovering the libglx.so in the usr/lib/xorg/modules/extensions/ from Tegra124_Linux_R19.3.0_armhf.tbz2.
Could you possibly provide a bit more information about your situation.
Are you able to go to command mode by pressing 'CTRL+F1' or 'CTRL+ALT+F1'?
If that works, it means your Jetson operating system is working but only the GUI is not working properly.
Yes, You can use ssh to your Jetson (what I do) if only the GUI of Jetson is broken and your OS is working properly. Note that in order to do so you need to know ip address of your Jetson and perform some possible router configuration.
Note: Sometimes if you have a USB device connected to your system (jetson), the jetson might mistakenly assume the USB is storage type and therefore tries to boot up from the USB. This leads to failure since it can not load any OS of the USB. (I'm not sure if this is the case for you)

Resources