On-premise Azure Service Fabric Services upgrade fails with no obvious error messages

Question

I am trying to upgrade a 3 node on-premise Service Fabric cluster. It is currently running 9.0.1553.9590 to 9.1.1883.9590 (previously sucesfully upgraded from 7.x > 8.x and onto 9.0)

I am able to trigger the upgrade as I have done on the previous ones and the first node appears to upgrade successfully, but then the healthchecks, before starting the 2nd upgrade domain VM seem, to fail and the upgrade is rolled back.

Noting that

The cluster appears otherwise healthy
I have tried doubling the upgrade timeout from 10 minutes to 20 minutes, I get the same result
The Windows Event Log shows no obvious errors, and all the various Service Fabric processes appears to load correctly, as you might expect given the first point

So does anyone have any suggestion how to debug this, or where to find any detailed upgrade logs?

On-premise Azure Service Fabric Services upgrade fails with no obvious error messages

Answers (1)

Related Questions