Tentacle update seems to hang in server ui during update after server uprade from 3.3.19 to v3.17.2

We recently updated our Octopus Server to version 3.17.2 from 3.3.19 and we are running into an issue upgrading the tentacles that I am wondering if anyone else has run into.

After completing the server upgrade we logged into the application and found (as expected) that the tentacles needed to be upgraded. We proceeded by clicking the [Upgrade these 9 machines] button in our development environment which immediately began the upgrade on 4 of the 9 machines in that environment. That process appeared to hang waiting for the tentacles to restart. It was the end of the day so we let it run overnight to see if it would complete normally on its own given enough time. Upon coming into the office in the morning we found that they were still waiting (It had been running for over 16 hours at that point). We logged into the machines in question and verified the services were indeed running and restarted the server service to see if it would pick up the fact that they had indeed restarted. It did immediately recognize them as restarted and began processing the next four, of which two more hung in the same state. After some digging and looking into the tentacle logs we found what appear to be an error in communication between the server and the tentacle:

2017-09-27 10:06:25.5753 3200 6 INFO The Windows Service has started
2017-09-27 10:06:25.6534 3200 3 INFO listen://[::]:99999/ 3 Accepted TCP client: [::ffff:172.1.1.1]:55160
2017-09-27 10:06:25.6690 3200 3 INFO listen://[::]:99999/ 3 Performing TLS server handshake
2017-09-27 10:06:47.2779 3200 3 INFO listen://[::]:99999/ 3 Unhandled error when handling request from client: [::ffff:172.1.1.1]:55160
System.IO.IOException: Unable to read data from the transport connection: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond. —> System.Net.Sockets.SocketException: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond
at System.Net.Sockets.NetworkStream.EndRead(IAsyncResult asyncResult)
— End of inner exception stack trace —
at System.Net.Security.SslState.EndProcessAuthentication(IAsyncResult result)
at System.Threading.Tasks.TaskFactory1.FromAsyncCoreLogic(IAsyncResult iar, Func2 endFunction, Action1 endAction, Task1 promise, Boolean requiresSynchronization)
— End of stack trace from previous location where exception was thrown —
at System.Runtime.CompilerServices.TaskAwaiter.ThrowForNonSuccess(Task task)
at System.Runtime.CompilerServices.TaskAwaiter.HandleNonSuccessAndDebuggerNotification(Task task)
at Halibut.Transport.SecureListener.d__15.MoveNext()

(see attached log for more details/Note: port and IP address details have been changed).

We proceeded to follow the same procedure we did earlier (restart the server service) and everything came back online again.

Noting that the servers in our development environment are older machines and hoping that was maybe the culprit we proceeded with running the tentacle updates in our test environment (which is all shiny new Windows 2016 servers) and one of those hung the same way as well.

This appears to be a network communication issue, but once again I am curious if anyone has seen anything like this before and what they may have found to fix/work around the issue.

Thanks!

TentacleLog.log (4 KB)

Hi Larry,

Thanks for getting in touch.

This issue has been giving us a lot of trouble lately as we haven’t been able to get to the bottom of it: https://github.com/OctopusDeploy/Issues/issues/3592

Usually it happens intermittently, I am surprised it has happened for all of your Tentacles. Can I ask what operating system your Server and Tentacles are running on?

From our investigations it looks like the server is hanging when trying to establish a connection to the Tentacle.

Cheers,
Shane

Shane,

The servers in our development environment are Windows 2008 R2. The servers in our test environment are Windows 2016. To be clear, not all servers hung. When upgrading it began by processing the first 4 of the 9 servers in our development environment. All 4 of those hung, once we got past those 4 on 2 of the last 5 hung. Only 1 of the 3 in our test environment hung during deployment.

Do you have any recommendations for a work around at this point (other than the process we have been using)?

Thanks for your assistance.
Larry House | Software Architect
Identifix | a Solera company
Find and Fix Faster
2714 Patton Road | Roseville, MN 55113
identifix.comhttp://www.identifix.com/
Facebookhttps://www.facebook.com/identifixit/ | Twitterhttps://twitter.com/identifixit | YouTubehttp://www.youtube.com/identifixtv | Linked inhttps://www.linkedin.com/company/2385376


FYI: If these is any info I can provide that might help in tracking down the root cause please let me know, I’d be happy to assist if I can.

Thanks!

Larry House

C | 952.999.6978

Software Architect

LI | www.linkedin.com/in/larry-house

2714 Patton Road | Roseville, MN 55113

W | www.identifix.comhttp://www.identifix.com/

[line image 1]

[cid:image004.jpg@01D32273.660392D0]

From: House, Larry [Identifix]
Sent: Thursday, September 28, 2017 9:28 PM
To: Shane Gill tender2+d3908c7c54@tenderapp.com
Subject: Re: Tentacle update seems to hang in server ui during update after server upgrade from 3.3.19 to v3.17.2 [Problems #59514]

Shane,

The servers in our development environment are Windows 2008 R2. The servers in our test environment are Windows 2016. To be clear, not all servers hung. When upgrading it began by processing the first 4 of the 9 servers in our development environment. All 4 of those hung, once we got past those 4 on 2 of the last 5 hung. Only 1 of the 3 in our test environment hung during deployment.

Do you have any recommendations for a work around at this point (other than the process we have been using)?

Thanks for your assistance.
Larry House | Software Architect
Identifix | a Solera company
Find and Fix Faster
2714 Patton Road | Roseville, MN 55113
identifix.comhttp://www.identifix.com/
Facebookhttps://www.facebook.com/identifixit/ | Twitterhttps://twitter.com/identifixit | YouTubehttp://www.youtube.com/identifixtv | Linked inhttps://www.linkedin.com/company/2385376


image003.jpg

image004.jpg