A couple of weeks ago we experienced a hardware failure in our network that caused my Cloverleaf cluster to lose connection. The recovery to bring my Cloverleaf interfaces back online was about 4 hours. Part of that was trying to determine why I could see the cluster, but couldn’t get the Cloverleaf Gui to start. Once I stopped and started my cluster services on my AIX box, I was able to get in. At that point, due to the hard crash, databases were corrupted and messages were hung, when had to be dropped to a file to try to minimize loss of data.
My question today is, what steps could I have taken to minimize the downtime in this situation?
Here is info about my server:
26 – Sites
121 – Processes
AIX 6.1.0.0
Cloverleaf 6.0.2.0
Any help is greatly appreciated.
Thanks,
Gina