Hello Clovertech and Friends,
I think I’ve got an OS issue and I need to bend the ear of those with Redhat/Linux knowledge.
We have cloverleaf 19.1.2 setup in a high availability configuration. Just on our second node, every two weeks — nearly exactly, All Cloverleaf processes will panic. When we run Cloverleaf on Node 1, this does not happen. The process log indicates the following:
<pre>[pti :sele:WARN/0: proc1_cmd:09/13/2023 16:41:12] Select returns -1 4: Interrupted system call
[dbi :dbi :ERR /0:proc1_xlate:09/13/2023 16:41:12] (-925) ‘RDM Embedded DB error: “LMC error: -925
[dbi :dbi :ERR /0:proc1_xlate:–/–/—- –:–:–] Lock manager communication error
[dbi :dbi :ERR /0:proc1_xlate:–/–/—- –:–:–] C errno = 32: Broken pipe”
[dbi :dbi :ERR /0:proc1_xlate:–/–/—- –:–:–] ‘
PANIC: “(errnum > -900) || (errnum < -976)”
PANIC: Calling “pti” for thread proc1_cmd
—– Scheduler State —–</pre>
I’ve reviewed many system logs, but can’t find anything interesting. The journalctl.log doesn’t seem to indicate any ‘smoking guns’. I don’t believe it’s an HA issues.
Anyone know what else I might be able to check?
Current Configuration:
OS: Red Hat Enterprise Linux release 8.8 (Ootpa)
Running on VMWare.
Cloverleaf: 19.1.2.1P High Availability.
- Jared Parish