If you delete the outbound thread, Cloverleaf is supposed to delete the routes as well. However, I deleted two routes in November, and about a month later, our engine crashed! After several hours of troubleshooting, we found that our source thread was still trying to send to the two outbound threads that I had deleted. All I did was delete the outbound threads and clicked “OK” when Cloverleaf informed me that it would delete the routes associated with the threads. I then bounced the process those threads were part of and verified they actually deleted from the network monitor.
During the time our engine crashed, the error logs kept showing that messages were trying to be sent to those two threads that I deleted a month prior. So basically, every message that hit the source thread from the time I deleted the two threads was trying to be routed to those deleted threads (the route was defined to send EVERYTHING). And this was coming from an ADT feed, so the volume was very high. I ended up having to rebuild those threads so I could search the recovery database. Sure enough, there were over a million messages set to go out to those threads. WHen the process was bounced, the attempt to route all of those messages was just CRUSHING the CPU on the server, and eventually, the process would crash. We ended up having to move all of the threads in that process to a new process. When we did that, everything came back up as expected, so it’s like cloverleaf didn’t fully delete those routes, and that original process still had some sort of configuration pointing at the deleted threads. I got all of those messages deleted out of the recovery database, restarted our engine and everything was back to normal.
I was just wondering if deleting the outbound thread is the recommended way to delete the route as well. For now, we go into the source thread and delete the routes completely before removing the threads to prevent this from happening again. Has anybody else ever seen this? Is this a possible Cloverleaf 5.8 bug?
We run a Fedora box with Cloverleaf 5.8. We have 1 site with about 150 threads. We also (for the most part) do everything with TCL procs, we don’t use many translations.
Thanks.