Over the last 6 months, we’ve been running problems where the tentacle service on some machines stops running. Looking at the logs doesn’t show any issues, so I’m not sure why it’s fairly. I will continue to look into it, but I need to way to mitigate the problem right now. Starting the Octopus Tentacle service always fixes the issue.
Is there some way to have the Octopus Service run a script when a machine fails a health check? I can easily write a script to make sure that the service is running, but there isn’t an obvious place to drive the script. What would be perfect is a way to get a list of the failing servers, and then run a script on either the list or each server. A health check failure action if you will.
Thanks for getting in touch. We have a watchdog command that creates a Windows Scheduled Task that you can make use of to help in this scenario. We’ve put together a step by step walk through for setting up watchdog: https://octopus.com/docs/administration/service-watchdog
We usually suggest watchdog for scenarios relating to machine reboot, but it sounds the issue you’re experiencing could also benefit.
Let us know if you still have issues and we can investigate further.