Reputation: 115
What happens when one of the multiple local SSDs attached to a compute engine instance has a hardware failure? Specifically:
Edit: I am aware of the "ephemeral" nature of local SSDs, and plan to replicate my data on multiple machines across different zones in my primary region, and at least one replication to a completely different region. The database I am planning to use is "data-center/rack aware". I am particularly looking for documentation/information about how Google Cloud Platform handles hardware failures in local SSDs.
Upvotes: 4
Views: 1543
Reputation: 4443
I'd like to clarify #5.
If your VM experiences host error google documentation states:
If the host system experiences a host error, Compute Engine makes a best effort to reconnect to the VM and preserve the local SSD data, but might not succeed. If the attempt is successful, the VM restarts automatically. However, if the attempt to reconnect fails, the VM restarts without the data.
Which means that you aren't guaranteed to get your data back. Which isn't fun plan accordingly and store your data in more reliable solutions such as persistent disks or buckets.
Upvotes: 1
Reputation: 9826
Points 4. and 5. are a bit tricky to answer - when an automatic restart for a hostError happens, you have a 60 minutes recovery timeout. This can however mean in practice that your instance is spending 60 minutes in a RUNNING but not booted state while trying to get a broken Local SSD back to then eventually fail and boot up with blank Local SSDs.
Overall, I would recommend you treat an instance as the failure domain and not individual disks as any sort of issue is likely to lead to a hostError of the instance instead of partial failure.
Upvotes: 2
Reputation: 76639
You might want to use persistent disks instead, because your use-case might not apply:
As adding local SSDs reads:
Local SSDs are suitable only for temporary storage such as caches, processing space, or low value data. If you store important data in a local SSD device, you must also store that same data in a durable storage option.
Upvotes: 3