Resolved -
This incident has been resolved.
May 9, 01:30 UTC
Monitoring -
Rebalancing our ingestion servers has stabilized the incoming data volume. Engineering is adding capacity to accelerate the processing of the data backlog. The impacted cluster has resumed ingestion at normal levels. All affected data has been queued since the beginning of the incident, so there is no data loss.
May 9, 00:07 UTC
Identified -
We have identified an issue with a backend cluster that is currently disrupting event ingestion. Our engineering team is performing a system failover to restore normal operations.
May 8, 22:50 UTC
Investigating -
We are currently investigating this issue.
May 8, 22:31 UTC