Await replies checked
Timeout = 60
on timeout Resend OB message selected
TPS IB Reply = hcitpsmsgkill
We occasionally have problem with one of the outbound threads. The thread starts resending the same message over and over again. I can see that we received one ACK for it. It seems that receiving system won’t send another ACK if it already sent one for the same message.
This happens only on one particular outbound thread and not all the time. Some days are better, some worse.
When it happens we have to stop the sending thread and delete that message (state 16) from recovery database. After that usually a few messages will go through and then the same thing happens again.
We are getting desperate and today two of us (interface programmers) were doing all day just that – stopping the thread, deleting message, starting again. No need to say that this caused our recovery database to grow and engine panics.
I am thinking about writing a procedure that will count number of resends (similar to what it used to be in recover_33) and then check acks file (we save inbound replies in a file) and if the last ACK received is for the message that we keep sending , kill the message.
The problem is I can’t figure out how to do it.
Any ideas? Or any other suggestions?
Thanks,
Sergey