Our health checks and deployments to certain target servers have started failing today, and I have been unable to determine a cause. It is specifically the servers to which Octopus should connect using SSH - all of them.
I’ve checked that we can connect to those servers ourselves, and I’ve checked the IP address whitelists to ensure that Octopus should be granted access, and yet all attempts to have Octopus connect to these servers has failed.
The failures seems to have started between about 5 and 21 hours ago - sometime overnight, our time.
Any advice on how we can more precisely determine the cause of the failure? Is there anything going on at the Octopus end that we should know about, that might be a factor?
I can pull the call stack of the failure out of the logs, but it doesn’t seem to say anything very interesting. We already know that the SSH simply cannot find the target.