Delivery to xlate thread failed. Requeuing msg.

This topic has 8 replies, 3 voices, and was last updated 8 years, 4 months ago by Bob Schmid.

Creator

Topic
August 24, 2016 at 2:24 pm #55172
Kevin Scantlan
Participant
We are on version Cloverleaf v 6.1.2 with AIX 6.1 .
Creator

Topic

Viewing 7 reply threads

Author

Replies
- August 25, 2016 at 3:54 pm #84451
  Rob Abbott
  Keymaster
  So a protocol thread is unable to deliver to the xlate thread within the same process? That is strange.
  
  Is the process name unusually long or have special characters?
  
  Are you able to contact the xlate thread when this is going on (e.g. hcicmd -p -c “_xlate purgex”)
  
  I would suggest you open a case with Support through Xtreme to get this resolved and determine if anyone else has seen this (I haven’t).
  
  We would certainly hate to see you back out an upgrade!
  
  Rob Abbott
  Cloverleaf Emeritus
- August 25, 2016 at 4:33 pm #84452
  Kevin Scantlan
  Participant
  We have a case open. It’s my guess that for some reason, version 6.1.2 is not handling a abort and it’s corrupting the ICL thread. When we bring the process back up is when we start getting the “Delivery” messages. When we clean up the ICL, it goes away.
- August 26, 2016 at 4:34 pm #84453
  Kevin Scantlan
  Participant
  Rob,
  
  I did not try to go a hcicmd command, but I presume that when we brought the process back up and the threads came back up that it was doing a hcicmd -p -c ‘ pstart” in the background.
  
  We had not made any changes from version 5.8.5 to 6.1.2 in the NetConfig, the xlates, or the tclprocs.
  
  We’ve been told that this has been submitted to R&D, but I’ve not heard anything from them. Would it be possible to follow up on this. It’s not happened again, but it’s a time bomb and we don’t know when it will happen again.
- August 29, 2016 at 2:42 pm #84454
  Rob Abbott
  Keymaster
  Kevin if you haven’t heard back on your Xtreme case you can ask for an update through Xtreme or give them a call. We really need to track issues like this through the support system. Thanks!
  
  Rob Abbott
  Cloverleaf Emeritus
- August 29, 2016 at 3:22 pm #84455
  Kevin Scantlan
  Participant
  Does anyone know of a way to force an abort in a process? Our issue occurs after a process aborts, but we’ve not had an abort for over a week (which is good). It seems like the only way we tell what is doing on, is to “catch it the act”. I’m not of the belief that it’s not what’s causing the abort that is the issue, but how the engine handles the abort. We don’t want to turn logging up and just wait for it to happen. We want to be able to force the issue on the test site that resides on the same server and turn on the logging there.
  
  Thanks.
- August 29, 2016 at 4:39 pm #84456
  Rob Abbott
  Keymaster
  If you’re on unix you could kill it with a SIGSEGV signal – e.g.
  
  kill -11
  
  This should cause a panic. If you use -9 then the process will die “dirty” without a panic.
  
  On Windows you would need to end the process from task manager.
  
  Rob Abbott
  Cloverleaf Emeritus
- August 30, 2016 at 2:44 pm #84457
  Kevin Scantlan
  Participant
  Thanks, Rob! I used your information and was able to recreate the error! I did a kill -11 on a process and it PANIC’ed that process. I brought the process back up and resent a single message into an inbound thread that is in the process. The message got “stuck” in state 2. I also got the following in the process log:
  
  [tcl :out :INFO/0: ccs_ci_4:08/30/2016 09:17:46] 160830 091746.1472566666 ccs_ci_4 CCSED DFT^P03 20160829110320 2016082911
  
  0320_001 67-99-39-6() 28368899() LEIVA^ELIZABETH^
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [cmd :cmd :INFO/0:test1_ps04_cmd:08/30/2016 09:17:46] Receiving a command
  
  [cmd :cmd :INFO/0:test1_ps04_cmd:08/30/2016 09:17:46] Command client went away. Closing connection.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  [pd :pdtd:ERR /0: ccs_ci_4:08/30/2016 09:17:46] [0.0.108735679] Delivery to xlate thread test1_ps04_xlate failed. Requeuing msg.
  
  I have been told that there is a “fix” (6.1.2.1). So once I install that fix, I can recreate the same scenario and see if it is indeed fixed.
  
  Thanks for your help. It appears we are on our way.
- April 3, 2017 at 1:32 pm #84458
  Bob Schmid
  Participant
  Might I have a status on this error as we are impacted as well.
  
  running 612 AIX
Author

Replies

Viewing 7 reply threads

The forum ‘Cloverleaf’ is closed to new topics and replies.