ORTE_ERROR_LOG: Data unpack would read past end of buffer in file grpcomm_direct.c - openmpi

Open MPI Version: v4.0.0
Output of ompi_info | head on two machine
mpiuser#s2:~$ ssh s1 ompi_info | head
Package: Open MPI mpiuser#s1 Distribution
Open MPI: 4.0.0
Open MPI repo revision: v4.0.0
Open MPI release date: Nov 12, 2018
Open RTE: 4.0.0
Open RTE repo revision: v4.0.0
Open RTE release date: Nov 12, 2018
OPAL: 4.0.0
OPAL repo revision: v4.0.0
OPAL release date: Nov 12, 2018
mpiuser#s2:~$ ompi_info | head
Package: Open MPI mpiuser#s2 Distribution
Open MPI: 4.0.0
Open MPI repo revision: v4.0.0
Open MPI release date: Nov 12, 2018
Open RTE: 4.0.0
Open RTE repo revision: v4.0.0
Open RTE release date: Nov 12, 2018
OPAL: 4.0.0
OPAL repo revision: v4.0.0
OPAL release date: Nov 12, 2018
Both are installed using common shared network.
while running command on s1(master)
mpiuser#s1:/disk3/cloud/openmpi-4.0.0/examples$ mpirun -n 2 ./hello
Hello, world, I am 1 of 2, (Open MPI v4.0.0, package: Open MPI mpiuser#s1 Distribution, ident: 4.0.0, repo rev: v4.0.0, Nov 12, 2018, 112)
Hello, world, I am 0 of 2, (Open MPI v4.0.0, package: Open MPI mpiuser#s1 Distribution, ident: 4.0.0, repo rev: v4.0.0, Nov 12, 2018, 112)
while running command separately in s2(slave)
mpiuser#s2:~/cloud$ mpirun -n 2 ./hello
Hello, world, I am 0 of 2, (Open MPI v4.0.0, package: Open MPI mpiuser#s2 Distribution, ident: 4.0.0, repo rev: v4.0.0, Nov 12, 2018, 113)
Hello, world, I am 1 of 2, (Open MPI v4.0.0, package: Open MPI mpiuser#s2 Distribution, ident: 4.0.0, repo rev: v4.0.0, Nov 12, 2018, 113)
Output of hwloc command on s2:
mpiuser#s2:~/cloud/openmpi-4.0.0$ dpkg -l | grep hwloc
mpiuser#s2:~/cloud/openmpi-4.0.0$
Output of hwloc command on s1:
mpiuser#s1:/disk3/cloud/openmpi-4.0.0/examples$ dpkg -l | grep hwloc
mpiuser#s1:/disk3/cloud/openmpi-4.0.0/examples$
Both machines are running on Ubuntu 16.04.5 LTS
but while running command on distributed giving following error
mpiuser#s1:/disk3/cloud/openmpi-4.0.0/examples$ mpirun -host s1,s2 ./hello
[s2:26283] [[40517,0],1] ORTE_ERROR_LOG: Data unpack would read past end of buffer in file grpcomm_direct.c at line 355
--------------------------------------------------------------------------
An internal error has occurred in ORTE:
[[40517,0],1] FORCE-TERMINATE AT Data unpack would read past end of buffer:-26 - error grpcomm_direct.c(359)
This is something that should be reported to the developers.
--------------------------------------------------------------------------

Please see this post as answer. The problem may come from a missing link from the zlib libary which is used to compress data from one host to another. Please make sure zlib.h is in /usr/include. If not: do sudo apt install zlib1g-dev and then reinstall entirely mpi through configure, make and make install.

Related

yum update showing package version which is already installed

My system has a libmaxminddb-1.2.0-10.el8.rpm package which is already installed. But when I run yum update --assumeno, the same package is asking to be reinstalled again, any idea why is this happening and how can I avoid it. Thanks
[root#vm x86_64]# ls | grep libmax
libmaxminddb-1.2.0-10.el8.rpm
[root#vm~]# yum update --assumeno
Repository 'My-Repository' is missing name in configuration, using id.
Repository 'updates' is missing name in configuration, using id.
Last metadata expiration check: 0:14:11 ago on Thu 04 Feb 2021 06:16:35 PM UTC.
Dependencies resolved.
=====================================================================================================================================================================================================
Package Architecture Version Repository Size
=====================================================================================================================================================================================================
Upgrading:
libmaxminddb x86_64 1.2.0-10.el8 My-Repository 33 k
Transaction Summary
=====================================================================================================================================================================================================
Upgrade 1 Package
Total download size: 33 k
Operation aborted.
Maybe the new package have same name, version, release and arch, but different epoch (which would be pretty ugly from packaging point of view)? Try to compare what you have and what is available.
To see details on package you have installed (in this example I'm using bash):
# rpm -q --queryformat "Epoch: %{EPOCH}\n" bash
Epoch: (none)
# rpm -q --info bash
Name : bash
Version : 5.0.17
Release : 2.fc33
Architecture: x86_64
Install Date: Tue 20 Oct 2020 01:36:17 AM CEST
Group : Unspecified
Size : 7709818
License : GPLv3+
Signature : RSA/SHA256, Tue 28 Jul 2020 12:10:09 AM CEST, Key ID 49fd77499570ff31
Source RPM : bash-5.0.17-2.fc33.src.rpm
Build Date : Mon 27 Jul 2020 03:17:35 PM CEST
Build Host : buildhw-x86-14.iad2.fedoraproject.org
Packager : Fedora Project
Vendor : Fedora Project
URL : https://www.gnu.org/software/bash
Bug URL : https://bugz.fedoraproject.org/bash
Summary : The GNU Bourne Again shell
Description :
The GNU Bourne Again shell (Bash) is a shell or command language
interpreter that is compatible with the Bourne shell (sh). Bash
incorporates useful features from the Korn shell (ksh) and the C shell
(csh). Most sh scripts can be run by bash without modification.
And then get same info on available package:
# yum download bash
# rpm -qp bash-5.0.17-2.fc33.x86_64.rpm --queryformat "Epoch: %{EPOCH}\n"
Epoch: (none)
# rpm -qp bash-5.0.17-2.fc33.x86_64.rpm --info
Name : bash
Version : 5.0.17
Release : 2.fc33
Architecture: x86_64
Install Date: (not installed)
Group : Unspecified
Size : 7709818
License : GPLv3+
Signature : RSA/SHA256, Tue 28 Jul 2020 12:10:09 AM CEST, Key ID 49fd77499570ff31
Source RPM : bash-5.0.17-2.fc33.src.rpm
Build Date : Mon 27 Jul 2020 03:17:35 PM CEST
Build Host : buildhw-x86-14.iad2.fedoraproject.org
Packager : Fedora Project
Vendor : Fedora Project
URL : https://www.gnu.org/software/bash
Bug URL : https://bugz.fedoraproject.org/bash
Summary : The GNU Bourne Again shell
Description :
The GNU Bourne Again shell (Bash) is a shell or command language
interpreter that is compatible with the Bourne shell (sh). Bash
incorporates useful features from the Korn shell (ksh) and the C shell
(csh). Most sh scripts can be run by bash without modification.

jenkins - phantomJS functional tests failing

I have downloaded latest version of below phantomJS driver for headless testing of my selenium functional tests in Jenkins CI.
phantomjs-2.1.1-linux-x86_64/bin/phantomjs
I get the below error message in Jenkins console log. Any idea what might be going on? I am suspecting something related to the version incompatibility but am not quite sure.
Running TestSuite
Nov 13, 2016 9:25:28 PM org.openqa.selenium.phantomjs.PhantomJSDriverService <init>
INFO: executable: /softwares/build.tool/.jenkins/jobs/SELENIUM_UI/workspace/resources/phantomjs-2.1.1-linux-x86_64/bin/phantomjs
Nov 13, 2016 9:25:28 PM org.openqa.selenium.phantomjs.PhantomJSDriverService <init>
INFO: port: 19175
Nov 13, 2016 9:25:28 PM org.openqa.selenium.phantomjs.PhantomJSDriverService <init>
INFO: arguments: [--webdriver=19175, --webdriver-logfile=/softwares/build.tool/.jenkins/jobs/SELENIUM_UI/workspace/phantomjsdriver.log]
Nov 13, 2016 9:25:28 PM org.openqa.selenium.phantomjs.PhantomJSDriverService <init>
INFO: environment: {}
/softwares/build.tool/.jenkins/jobs/SELENIUM_UI/workspace/resources/phantomjs-2.1.1-linux-x86_64/bin/phantomjs: /lib64/libz.so.1: no version information available (required by /softwares/build.tool/.jenkins/jobs/SELENIUM_UI/workspace/resources/phantomjs-2.1.1-linux-x86_64/bin/phantomjs)
/softwares/build.tool/.jenkins/jobs/SELENIUM_UI/workspace/resources/phantomjs-2.1.1-linux-x86_64/bin/phantomjs: /usr/lib64/libstdc++.so.6: version `GLIBCXX_3.4.9' not found (required by /softwares/build.tool/.jenkins/jobs/SELENIUM_UI/workspace/resources/phantomjs-2.1.1-linux-x86_64/bin/phantomjs)
/softwares/build.tool/.jenkins/jobs/SELENIUM_UI/workspace/resources/phantomjs-2.1.1-linux-x86_64/bin/phantomjs: /usr/lib64/libstdc++.so.6: version `GLIBCXX_3.4.11' not found (required by /softwares/build.tool/.jenkins/jobs/SELENIUM_UI/workspace/resources/phantomjs-2.1.1-linux-x86_64/bin/phantomjs)
/softwares/build.tool/.jenkins/jobs/SELENIUM_UI/workspace/resources/phantomjs-2.1.1-linux-x86_64/bin/phantomjs: /lib64/libc.so.6: version `GLIBC_2.9' not found (required by /softwares/build.tool/.jenkins/jobs/SELENIUM_UI/workspace/resources/phantomjs-2.1.1-linux-x86_64/bin/phantomjs)
/softwares/build.tool/.jenkins/jobs/SELENIUM_UI/workspace/resources/phantomjs-2.1.1-linux-x86_64/bin/phantomjs: /lib64/libc.so.6: version `GLIBC_2.10' not found (required by /softwares/build.tool/.jenkins/jobs/SELENIUM_UI/workspace/resources/phantomjs-2.1.1-linux-x86_64/bin/phantomjs)
/softwares/build.tool/.jenkins/jobs/SELENIUM_UI/workspace/resources/phantomjs-2.1.1-linux-x86_64/bin/phantomjs: /lib64/libc.so.6: version `GLIBC_2.7' not found (required by /softwares/build.tool/.jenkins/jobs/SELENIUM_UI/workspace/resources/phantomjs-2.1.1-linux-x86_64/bin/phantomjs)
Nov 13, 2016 9:25:49 PM org.openqa.selenium.os.UnixProcess checkForError
SEVERE: org.apache.commons.exec.ExecuteException: Process exited with an error: 1 (Exit value: 1)
Tests run: 7, Failures: 1, Errors: 0, Skipped: 6, Time elapsed: 22.943 sec <<< FAILURE! - in TestSuite
pageSetup(com.visualiq.automation.selfservice.ClientSetupTest) Time elapsed: 22.748 sec <<< FAILURE!
org.openqa.selenium.WebDriverException:
Timed out waiting for driver server to start.
Build info: version: '3.0.1', revision: '1969d75', time: '2016-10-18 09:49:13 -0700'
System info: host: 'hnet11234.inetservices.com', ip: '10.300.14.86', os.name: 'Linux', os.arch: 'amd64', os.version: '2.6.18-371.1.2.el5', java.version: '1.8.0_101'
Driver info: driver.version: PhantomJSDriver
All, I am able to solve the issue by using phantomJS version 1.9.8 instead of version 2.1.1. The latest version 2.1.1 has some compatibility issues with the version of Red Hat Linux my Jenkins is running on.
Jenkins Box:
Linux inet11086.inetservices.com 2.6.18-371.1.2.el5 #1 SMP Mon Oct 7 16:34:35 EDT 2013 x86_64 x86_64 x86_64 GNU/Linux
LSB Version: :core-4.0-amd64:core-4.0-ia32:core-4.0-noarch:graphics-4.0-amd64:graphics-4.0-ia32:graphics-4.0-noarch:printing-4.0-amd64:printing-4.0-ia32:printing-4.0-noarch
Distributor ID: RedHatEnterpriseServer
Another option to go with was to update libraries in Red Hat box but I went with using downgraded version of phantomJS. Please let me know if you need more info on this!

QWidget - Multi Monitor - Single Desktop - X_CopyArea Bug under Ubuntu

I can't figure out what's going on with this Bug i got:
X Error: BadMatch (invalid parameter attributes) 8
Major opcode: 62 (X_CopyArea)
Resource id: 0x4e00004
I wanted to display two widgets on two Monitors with one Application.
I configured my Target with the following screen Configuration (Single Desktop, Multiple Monitors):
Screenshot
Weird is the fact that it works for an old X-Server under Suse. Under Ubuntu it doesn't.
Here are source and X-Versions
Btw:
I must use this config (there must be no virtual Desktop on two screens), and i just want to get why it doesn't work.
- Same dirver
- so qt-bug?
source.zip can be downloaded here
#include <qapplication.h>
#include <qwidget.h>
#include <qdesktopwidget.h>
int
main(int argc, char *argv[])
{
QApplication app(argc, argv);
QWidget widgetScreenZero(qApp->desktop()->screen(0));
QWidget widgetScreenOne(qApp->desktop()->screen(1));
widgetScreenZero.setStyleSheet("background-image: url(:/images/zero.svg);");l
widgetScreenOne.setStyleSheet("background-image: url(:/images/one.svg);");
widgetScreenOne.show();
widgetScreenZero.show();
return app.exec();
}
+
Suse:
X.Org X Server 1.8.0
Release Date: 2010-04-02
X Protocol Version 11, Revision 0
Build Operating System: openSUSE SUSE LINUX
Current Operating System: Linux ws17 2.6.34-12-desktop #1 SMP PREEMPT 2010-06-29 02:39:08 +0200 i686
Kernel command line: root=/dev/disk/by-id/ata-KINGSTON_SV100S264G_64GB60076281-part1 resume=/dev/disk/by-id/ata-KINGSTON_SV100S264G_64GB60076281-part2 splash=silent quiet crashkernel= vga=0x345
Build Date: 05 July 2010 09:26:26PM
Current version of pixman: 0.18.0
Before reporting problems, check http://wiki.x.org
to make sure that you have the latest version.
+
Ubuntu:
X.Org X Server 1.14.5
Release Date: 2013-12-12
X Protocol Version 11, Revision 0
Build Operating System: Linux 2.6.42-37-generic i686 Ubuntu
Current Operating System: Linux ws25 3.11.0-15-generic #25~precise1-Ubuntu SMP Thu Jan 30 17:42:40 UTC 2014 i686
Kernel command line: BOOT_IMAGE=/boot/vmlinuz-3.11.0-15-generic root=UUID=ce61481b-be95-4a6b-a5db-1f154dc14cd6 ro quiet splash vt.handoff=7
Build Date: 06 January 2014 01:41:06PM
xorg-server 2:1.14.5-1ubuntu2~saucy1~precise2 (For technical support please see http://www.ubuntu.com/support)
Current version of pixman: 0.30.2
Before reporting problems, check http://wiki.x.org
to make sure that you have the latest version.

RPM spec to require specific RHEL release

In an RPM spec file, what is a good way of requiring a minimum RHEL distribuition? I tried to make RHEL 6.3 a prerequisite using:
Requires: redhat-release-server-6Server >= 6.3
This blocks the install on 6.2 and 6.3 with:
error: Failed dependencies:
redhat-release-server-6Server >= 6.3 is needed by my.package-1.2.0-0.x86_64
I also tried:
Requires: redhat-release >= 6.3
which did not fail the dependecy check on either 6.2 or 6.3.
The redhat-release-server-6Server-6.3.0.3.el6.x86_64 (on my system) seems to be a good candidate as a prerequisite. See http://rhn.redhat.com/errata/RHEA-2012-0971.html . What is not obvious for a novice looking at the package name is that "6Server" is actually the package version. "rpm -q --info" make this clear, though:
$ rpm -q redhat-release-server-6Server --info
Name : redhat-release-server Relocations: (not relocatable)
Version : 6Server Vendor: Red Hat, Inc.
Release : 6.3.0.3.el6 Build Date: Wed 30 May 2012 11:19:03 AM PDT
Install Date: Tue 02 Oct 2012 09:48:26 AM PDT Build Host: x86-003.build.bos.redhat.com
Group : System Environment/Base Source RPM: redhat-release-server-6Server-6.3.0.3.el6.src.rpm
Size : 38585 License: GPLv2
Signature : RSA/8, Wed 30 May 2012 12:19:55 PM PDT, Key ID 199e2f91fd431d51
Packager : Red Hat, Inc. <http://bugzilla.redhat.com/bugzilla>
Summary : Red Hat Enterprise Linux Server release file
Description :
Red Hat Enterprise Linux Server release files
So the correct directive is:
Requires: redhat-release-server >= 6Server-6.3
I would suggest that instead of required specific version of RHEL, Require the dependency that only appear in the target version (e.g. RHEL 6.3)
Following not only cause problem you mentioned, but also block the installation for the client and workstation users (6Client and 6Workstation)
Requires: redhat-release-server-6Server >= 6.3

libstdc++.so.6: cannot handle TLS data

I have an application compiled at:
gcc version 4.1.2 20061115 (prerelease) (Debian 4.1.1-21)
Linux debian 2.6.18-5-686 #1 SMP Fri Jun 1 00:47:00 UTC 2007 i686 GNU/Linux
and it runs well.
Now I want to run it at:
Linux 2.4.20_mvlcge31-tomas #7 Thu May 7 11:33:21 CEST 2009 i686 unknown
I got following errors:
libstdc++.so.6: cannot handle TLS data
From the web I saw someone suggested to do this: export LD_ASSUME_KERNEL=2.2.5
I tried but get even more errors:
ls: error while loading shared libraries: librt.so.1: cannot open shared object file: No such file or directory
Who can help me with it? thanks
You had compiled the application against much newer libc and kernel version, You can't compile program on 2.6 with newest libc and expect it to run on old kernel.
Also where do you actually still use Linux 2.4?

Resources