Engage360 Portal Issues

Incident Report for Forward Thinking Systems

Postmortem

One of our backend databases is in the cloud. We lost our Fastconnect between our NY datacenter and the cloud location of this database, for about 30 minutes today. At around 10 minutes we decided to fail over to the backup IPSEC connection to be able to keep serving data to critical applications (Engage360 + Titan apps). The failover took approximately another 10 minutes (currently that specific process is manual, soon to be automated).

For the future, this should not be an issue. We are automating the failover for this specific site-to-site connection in the next 2 weeks. We are also in the process of migrating this specific database to our on-prem datacenter, eliminating this specific fail point altogether.

Posted May 03, 2023 - 12:30 EDT

Resolved

This incident has been resolved.
Posted May 03, 2023 - 12:24 EDT

Monitoring

Fix has been implemented, we are currently monitoring to make sure the problem does not come back.
Posted May 03, 2023 - 10:48 EDT

Identified

An issue with backend servers has been identified and is being worked on. We will update within the next 30 minutes.

Affected services: Engage360 portal and Titan Installer app
Posted May 03, 2023 - 10:25 EDT
This incident affected: Portals (Engage360).