IntelliHub Outage

Incident Report for Forward Thinking Systems

Postmortem

In this case, there were 2 separate issues.

  1. A bug was found in our edge routers which caused excessive RAM usage, putting the router into “Conserve Mode” which started to make Intellihub and Engage360 inaccessible for most users. After rebooting, traffic resumed normally. This caused major issues for users for about 18 minutes.
  2. A second bug was exposed (and exacerbated by the first issue) in camera devices not reporting or streaming correctly. This was due to how our servers receiving the data handle their connection to the backend database. The end result was that servers were having trouble maintaining connection to devices, which delayed reporting and made livestreaming innaccessible for a large amount of units. This caused issues on and off between 12:30PM - 6:00PM EST before root cause was found and fixed.

A full incident report is available upon request.

Posted Aug 13, 2024 - 13:25 EDT

Resolved

We are Investigating reports of Intellihub and Engage being down
Posted Aug 08, 2024 - 11:30 EDT