CLOVERLEAF(R) Integration Services 5.8.5.3P
The details are below, has anyone seen this before?
Firstly, there may be issues we have that very few other sites have!
Our production server is running at 100% peak with a load average on occasion of over 20 (not our choice!)
When this issue happened, the server was heavily loaded and both the Cloverleaf thread and the receiving connection were being repeatedly bounced.
I have executed the unit test script with various modifications for our resend code and have been unable to re-create the issue.
We have Cloverleaf production sites running Cloverleaf versions 5.4, 5.6 and 5.8 and use the same ‘resend’ code in all versions and the code has been used for many years.
In the ‘resend code’, when a message has been sent, no ACK has been received and the Cloverleaf thread stopped and restarted, we ‘trap’ in the ‘reply_gen’ context and log
then log context ‘send_data_ok’ for the message delivery
In the production logs, the message still exists on thread shutdown as expected
i.e.
[cmd :cmd :INFO/0:snk_prod_ih_cmd:07/16/2012 12:00:36] Doing ‘pstop’ command on thread ‘pas_prod_ih_adt_snd’
12:00:36 pas_prod_ih_adt_snd gen_code_resend:INFO :
12:00:36 pas_prod_ih_adt_snd gen_code_msginfo:INFO : Thread is shutting down
12:00:36 pas_prod_ih_adt_snd gen_code_resend:INFO :
12:00:36 pas_prod_ih_adt_snd gen_code_resend:INFO :
12:00:36 pas_prod_ih_adt_snd gen_code_resend:INFO :
12:00:36 pas_prod_ih_adt_snd gen_code_printmsg:INFO : Thread is shutting down
12:00:36 pas_prod_ih_adt_snd gen_code_resend:INFO :
[prod:prod:INFO/0:pas_prod_ih_adt_snd:07/16/2012 12:00:36] Checking for leaked handles in the General interpreter…
[prod:prod:INFO/0:pas_prod_ih_adt_snd:07/16/2012 12:00:36] Checking for leaked handles in the TPS interpreter…
Handle Allocated by
====== ============
message0
WARNING: Message [0.0.647402] is in the RDB and was left bound into Tcl
But on thread start up, the ‘state 14’ is not trapped.
[cmd :cmd :INFO/0:snk_prod_ih_cmd:07/16/2012 12:00:37] Doing ‘pstart’ command on ‘pas_prod_ih_adt_snd’
[prod:prod:INFO/0:pas_prod_ih_adt_snd:07/16/2012 12:00:37] Starting protocol thread pas_prod_ih_adt_snd as tid 4.
[prod:prod:INFO/0:pas_prod_ih_adt_snd:07/16/2012 12:00:42] Applying EO config: ”
12:00:45 pas_prod_ih_adt_snd gen_code_resend:INFO : RCS Info $Id: gen_code_resend.tcl,v 1.27 2012/05/10 09:05:29 he00387 Exp $
…
12:00:46 pas_prod_ih_adt_snd gen_code_resend:INFO :
Compared to the unit test on thread startup …
[cmd :cmd :INFO/0:he00387_snd_cmd:07/23/2012 13:37:02] Doing ‘pstart’ command on ‘gen_code_ai_xxx_snd’
[prod:prod:INFO/0:gen_code_ai_xxx_snd:07/23/2012 13:37:03] Applying EO config: ”
13:37:03 gen_code_ai_xxx_snd gen_code_resend:INFO : RCS Info $Id: gen_code_resend.tcl,v 1.27 2012/05/10 09:05:29 he00387 Exp $
…
13:37:03 gen_code_ai_xxx_snd gen_code_resend:INFO :
13:37:03 gen_code_ai_xxx_snd gen_code_resend:INFO :