Write issues to data placement groups of type SSD in eu-se-1a (was: Slow storage due to traffic event in eu-se-1a)

Status: Fixed - on 11/2/22
Description:

Post mortem at 2022-11-02 09:30 AM (GMT+1, Europe/Stockholm timezone) an event caused a lot of storage traffic to occur in our eu-se-1a availability zone which affected availability in the form of slow storage.


Update (2022-11-03):

We’ve traced the root cause of this incident to an issue relating to a routine maintenance event where a small amount of the distributed data placement groups of type SSD could not be written.

This was due to our storage software having durability concerns because replica writes could not be guaranteed. The distributed data placement groups that could not be written in turn halts the I/O path for writes to that group, giving the appearance of resources not being available.

This started at 09:13:38, recovery was initiated at 09:25:34 and full service was restored at 09:28:22 (GMT+1 Europe/Stockholm)

We are very sorry for any inconvenience this might have caused our customers, we consider availability of our services to be paramount and will evaluate our routines and policies relating to the maintenance event in question.

Please note that the previous mentioned reason was a side-effect of the maintenance event and not the root-cause (or cause) for this incident.

Update on 11/2/22
Status changed from Identified Fixed

New information:

We have identified the issue and will work on improving the handling of such events in the future.

Update on 11/3/22
Status changed from Fixed Fixed

New information:

After further root cause analysis (RCA) we've updated the description and naming on this incident.

Contact

Phone: +46 020-11 33 80
Email: helpdesk@binero.com
Web: https://www.binero.com

Binero Group AB

Gustavslundsvägen 151 G
167 51 Bromma, Sweden
Org. number: 556264-3022

logo
Copyright © 2025 Binero Group AB