On March 19th, our systems experienced three separate outages, each lasting approximately an hour before they were resolved. Upon investigation, it was determined that all three outages were caused by the same issue: an unexpected spike in authentication traffic load. This surge rendered our backend servers unresponsive to login requests via both web and mobile applications.To address the immediate cause of the outages, we executed full reboots of all system components. While this action successfully restored functionality, it may have resulted in brief additional periods of outage for customers within the app, as well as apparent anomalies such as failed save requests during the reboot process.Yesterday, our team investigated the root cause and implemented several solutions to prevent similar occurrences in the future:
The combination of these proactive measures and infrastructure improvements significantly reduces the likelihood of similar incidents affecting our customers in the future. We are confident that our system is now better equipped to handle unexpected spikes in traffic and maintain uninterrupted service.