At 08:57 UTC (12:57 PST) on 2021-11-25, we observed elevated latencies and errors for the Storage service in the US-East PoP. The elevated latency caused the inability to retrieve messages from storage within the expected timeframe.
This issue occurred because our storage vendor was running deployments in the US-East PoP region which resulted in higher storage write latencies. We contacted our storage vendor to postpone the deployments to later in the day when traffic levels were lower. The issue was resolved at 14:08 UTC.
Mitigation Steps and Recommended Future Preventative Measures
To prevent a similar issue from occurring in the future we will coordinate optimal times for off-peak hours deployments.