Handle rate limiting error when fetching #32

evilmarty · 2017-07-18T05:55:18Z

If CloudWatch raises a ThrottlingException it means there's too many requests. This error bubbles up and causes the plugin to crash in Logstash which then re-starts it. Instead we simply return from process_group and try again later on. Thus losing track since last last fetch (if not persisted).

evilmarty · 2017-07-18T22:26:24Z

I notice you've implemented your own throttling handler. Will close this PR.

lukewaite · 2017-07-19T14:20:28Z

@evilmarty Yeah - that's my bad. I saw the email subject, didn't notice a PR, and went "oh crap, that's a regression", and went to fix it. Then went looking for the issue, and noticed it was a PR, but that you'd implemented it slightly differently.

Your method would switch immediately to the next log_group (if multiple), which could then be throttled again, but would benefit from the set interval as a backoff attempt. The benefit here would be more cleanly exiting without having to implement any special handling to do so if being throttled and attempting to pull in vast amounts of data.

My method backs off, but stays in the current group until complete, before moving on. It however doesn't exit cleanly if asked to do so.

I'm still not sure which is better.

In general I'm open to opinions on this one..

evilmarty · 2017-07-20T04:50:50Z

@lukewaite I've updated my PR with a slight change to my approach. Instead of handling the error at the point of call I have moved the rescue up further to the main loop. As you pointed out we simply wait during the interval pause and try again albeit a different log group to ensure none are neglected. What are your thoughts?

If CloudWatch raises a ThrottlingException it means there's too many requests. This error bubbles up and causes the plugin to crash in Logstash which then re-starts it. Instead we break for `interval` duration and then start again from the log group where we left off. After a log group is read we push its priority to bottom to ensure every log group is inspected.

lukewaite · 2017-07-28T19:33:31Z

@evilmarty Sorry for the delay in getting back to you on this one. First pass looks good, but I'd like to test locally before merging.

evilmarty · 2017-07-29T00:19:56Z

@lukewaite no worries. I've been using this code for over a week without hiccups in both my staging and production environments. Let me know how you go.

lukewaite · 2017-08-05T15:18:51Z

@evilmarty I've reverted 1256571 in prep for merging this, as it's not in your fork, and I don't think it's necessary with these changes.

lukewaite · 2017-08-05T15:22:52Z

lib/logstash/inputs/cloudwatch_logs.rb

-        process_group(group)
-      end # groups.each
+      begin
+        groups = find_log_groups


I'm wondering if I should move this line out of your begin/rescue here and add back in the handling for find_log_groups that was part of 1256571.

I think that if we got throttled fetching log groups here - we might get into a situation where we're always being throttled, and just stopping.

It's probably a very rare corner case... but might be better to handle it differently than how we handle an interruption while processing a group.

find_log_groups hits the API as well which can be throttled. If it's not handled then it'll simply error and cause the whole plugin to reload (potentially losing position etc). With this change throttling is always handled for all API calls.

evilmarty · 2017-08-17T05:26:58Z

@lukewaite Anything further holding up this PR?

lukewaite · 2017-08-22T15:06:48Z

No blockers, @evilmarty, other than making the time for it which I haven't. My apologies.

I'll merge in now and tag a patch release.

lukewaite · 2017-08-22T16:10:12Z

Tagged now and published to rubygems.
https://github.com/lukewaite/logstash-input-cloudwatch-logs/releases/tag/v1.0.1

evilmarty closed this Jul 18, 2017

lukewaite reopened this Jul 19, 2017

evilmarty force-pushed the master branch from cbebde7 to 40b75a4 Compare July 20, 2017 07:34

lukewaite reviewed Aug 5, 2017

View reviewed changes

lukewaite merged commit 84cd88a into lukewaite:master Aug 22, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle rate limiting error when fetching #32

Handle rate limiting error when fetching #32

evilmarty commented Jul 18, 2017

evilmarty commented Jul 18, 2017

lukewaite commented Jul 19, 2017

evilmarty commented Jul 20, 2017

lukewaite commented Jul 28, 2017

evilmarty commented Jul 29, 2017

lukewaite commented Aug 5, 2017

lukewaite Aug 5, 2017

evilmarty Aug 6, 2017

evilmarty commented Aug 17, 2017

lukewaite commented Aug 22, 2017

lukewaite commented Aug 22, 2017

Handle rate limiting error when fetching #32

Handle rate limiting error when fetching #32

Conversation

evilmarty commented Jul 18, 2017

evilmarty commented Jul 18, 2017

lukewaite commented Jul 19, 2017

evilmarty commented Jul 20, 2017

lukewaite commented Jul 28, 2017

evilmarty commented Jul 29, 2017

lukewaite commented Aug 5, 2017

lukewaite Aug 5, 2017

Choose a reason for hiding this comment

evilmarty Aug 6, 2017

Choose a reason for hiding this comment

evilmarty commented Aug 17, 2017

lukewaite commented Aug 22, 2017

lukewaite commented Aug 22, 2017