Cloverleaf 5.8 rev3 – monitor daemon issues

Clovertech Forums Read Only Archives Cloverleaf Cloverleaf Cloverleaf 5.8 rev3 – monitor daemon issues

  • Creator
    Topic
  • #52587
    Jerry Magrann
    Participant

      We are running 5.8-3 on Linux 5.3 and have been having issues where the monitor daemon becomes unresponsive every day or so and starts cutting off Cron and TCP connections. When we try to stop and start the monitor daemon, it does not always come down gracefully (screen shot attached).

    Viewing 6 reply threads
    • Author
      Replies
      • #74781
        Jim Rawls
        Participant

          Sounds like a file permissions issue; was there any more information in the process log?

        • #74782
          Jerry Magrann
          Participant

            We did get this message – “write to fifo failed” which our led our system admin to look at the number of files that our HCI user could have open which was the default of 1024. This 5.8 is in test right now in hopes to bring it live in a couple weeks, so we compared our ulimit on our old system and it was double the default, so we upped it on the 5.8 system and are monitoring in hopes this is the fix.

          • #74783
            Anonymous
            Participant

              We to are having issues almost daily, I come in, open the IDE and all sites appear dead.

              I run hciss and hcisitectl and they both time out. I then bounce monitor daemon and things just start working. This happened shortly after the rev3 patch was installed.

            • #74784
              Russ Ross
              Participant

                Jerry in the document for installing cloverleaf there is usually suggested values for setting limits.

                Russ Ross
                RussRoss318@gmail.com

              • #74785
                Jerry Magrann
                Participant

                  Since we up’d the limit of the number of files our hci user can have opened from 1024 to almost 8000, we have had no issues with our monitor daemon and the app seems to be running better as a whole. We are planning to move the 5.8-4 build to production this coming Sunday where we we will really put it to the test with volume and such. Will let you all know if we have any issues in prod.

                • #74786
                  Anonymous
                  Participant

                    What command do you use to get the values below?

                    hci:

                          fsize = 2097151

                          core = 2097151

                          cpu = -1

                          data = 3145728

                          rss = 524288

                          stack = 524288

                          nofiles = 10000

                  • #74787
                    James Cobane
                    Participant

                      ulimit -a

                      Or you can look at the /etc/security/limits file directly….

                      Jim Cobane

                      Henry Ford Health

                  Viewing 6 reply threads
                  • The forum ‘Cloverleaf’ is closed to new topics and replies.