if one safekeeper crashes without a way to recover, humans are still needed?

If the safekeeper is up again, it can join the cluster automatically. But if it crashes without a way to recover, we need to change cluster membership. Right now, such a change requires humans to be in the loop to ensure that the old safekeeper is actually down. It is on our roadmap to automate this procedure.

was it done? Thanks.

4 Replies

complex-teal•8mo ago

Automatic handling of Safekeeper failure is currently being worked on, but losing a Safekeeper has no impact on serice availability or performance. Writes continue to the other Safekeepers.

other-emeraldOP•8mo ago

but we have to add one new Safekeeper back to keep safety, is that automated?

complex-teal•8mo ago

Adding a new Safekeeper back is not automated yet, but it's being worked on now to remove the manual handling process.

other-emeraldOP•8mo ago

Gaming

Programming

if one safekeeper crashes without a way to recover, humans are still needed?

Did you find this page helpful?