This morning we had an issue where a process was stopped from the Network Monitor GUI and could not be re-started. One of our SysAdmins (who normally doesn’t work with Cloverleaf) identified a PID that was still running and killed it so the process could be restarted.
I’m trying to get a little smarter on how this works and get more self-sufficient…so pls forgive the wordiness!
When I ps -ef | grep
— an ‘hciengine’ entry followed by the site, process and thread details, and
— an ‘hcienginewatch’ perl script entry with the same details.
Based on the entry names, I take it that the former is the actual interface and the latter controls the GUI display.
When the process was stopped this morning, the GUI showed the process block red and the thread boxes as dead. I take it to mean that the ‘hcienginewatch’ stopped successfully, but ‘hciengine’ was still running so it wouldn’t allow a restart. There were no readily apparent errors we could identify.
So to bring down the ‘hciengine’ PID, a kill PID was executed, then ps -ef again to make sure it’s gone and then the process started normally from the GUI.
If all that sounds right, should there be any other ‘housekeeping’ steps to clean this up, or other things we can check that might lead us to a cause or prevent future problems? If I’m not on the right track, let me know that too.
Thanks in advance!!