Intermittent Downtime in Rails Application Running on AWS App Runner #261

svyatmuzyka · 2024-09-17T20:22:09Z

Description:
I'm experiencing intermittent downtime with my Rails application hosted on AWS App Runner. A third-party service, Crisp, monitors the availability of my site and sends notifications whenever it is unreachable. I receive these downtime alerts from Crisp a couple of times a day.

Steps to reproduce:

Deploy a Rails application on AWS App Runner.
Set up a monitoring service (e.g., Crisp) to check site availability.
Observe the intermittent downtime notifications.

Expected behavior:
The application should remain available consistently, especially since there are no signs of high CPU or memory usage.

Observed behavior:
Even though the application is running under low load (as indicated by AWS App Runner metrics), Crisp reports that my site is unavailable for short periods. During these periods:

Metrics show no CPU or memory exhaustion.
However, AWS CloudWatch logs indicate gaps of up to 4 minutes between the last stream's end and the beginning of the next one, which aligns with the reported downtime.

Questions:

What could be causing these 4-minute gaps in the logs, which coincide with the periods when my site is unreachable?
Why might the Rails application be stopping so frequently, even when resource usage seems normal?

Logs and metrics:
I have attached relevant logs above and metric graphs below. Let me know if you need more details.

EliteXCoder1 · 2024-11-27T20:34:45Z

I think i am having a similar issue with my django/python app. Everything was working fine until i did a recent update 3 days ago and now at least like once or twice every hour but only sometimes and a new instance gets deployed when this was never happening before. My app will be down for a couple of minutes until it's available again... My metrics do not show any spike in CPU or Memory before it happens and I know this is correct cause my app doesn't have much traffic at this point. Cloud watch generates new instance logs every time this happens so i know when a new instance gets created. I have the dev version in the same environment, just a difference service and its not happening to that service. Whenever this issue happens, i get the following in the browser until the new instance is up:

upstream connect error or disconnect/reset before headers. reset reason: connection failure, transport failure reason: delayed connect error: 111

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Intermittent Downtime in Rails Application Running on AWS App Runner #261

Intermittent Downtime in Rails Application Running on AWS App Runner #261

svyatmuzyka commented Sep 17, 2024

EliteXCoder1 commented Nov 27, 2024

Intermittent Downtime in Rails Application Running on AWS App Runner #261

Intermittent Downtime in Rails Application Running on AWS App Runner #261

Comments

svyatmuzyka commented Sep 17, 2024

EliteXCoder1 commented Nov 27, 2024