chore(test): easy way to get cluster stuck #741

AVVS · 2018-10-26T23:40:33Z

No description provided.

AVVS · 2018-10-27T00:19:12Z

Pushed a possible solution to the problem

Problem that I'm trying to solve:

resolve is lost when we issue .disconnect(true)

Workaround

start tracking all .connect() promises via .connectionPromise private variable, which exposes resolve and reject for the next connect promise and self-erases upon resolution

Things to think about

garbage collection - I do seem to be cleaning up everything, but it still introduces tons of cross-references
instead listen to events (?), ie ready, close inside the ready handler alongside .disconnect(true)
simply reject() .connect() promise - that requires a breaking change to current behaviour, as one would have to implement retrying outside of ioredis
reject & implement retrying (via bluebird-retry library for instance)

AVVS · 2018-10-31T22:11:18Z

@luin any thoughts on that?

AVVS · 2018-11-07T08:24:53Z

@shaharmor issue described here - #709

stale · 2018-12-07T09:00:18Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 7 days if no further activity occurs, but feel free to re-open a closed issue if needed.

AVVS · 2018-12-11T17:57:05Z

@luin what do you think of the approach, should we just redo this into event-based handling instead of chaining promises?

luin · 2018-12-12T17:19:49Z

@luin what do you think of the approach, should we just redo this into event-based handling instead of chaining promises?

Sorry for the late response! Been pretty busy lately 😢 . This approach works while add a little complexity. I would consider Cluster#connect() failed when the current connection attempt is not successful (although the subsequent ones may success). This keeps the same behavios as Redis#connect(). People need to rely on the events like "ready"/"connect"/"close"/"end" if they need to know the connection status in a higher level (most cases will be this).

Currently, the problem seems to be that we remove the close handler too early so the "reject()" won't be called if a disconnection happens between the "refresh" event and "ready" event. To solve that, we may move the removeListener('close') to the ready event callback, so the Cluster#connect() will be rejected in your test case (although the cluster wil be ready eventually). What do you think?

AVVS · 2018-12-19T22:12:38Z

Currently, the problem seems to be that we remove the close handler too early so the "reject()" won't be called if a disconnection happens between the "refresh" event and "ready" event. To solve that, we may move the removeListener('close') to the ready event callback, so the Cluster#connect() will be rejected in your test case (although the cluster wil be ready eventually). What do you think?

I think rejecting makes sense, and then users can handle what to do - as long as promise is not stuck forever we are already way ahead of this

At the same time I think it would be great to add a convenience method, which would do auto-reconnect sort of thing with listening to ready/close/error events and possibly some sort of auto-retry strategy with a finite end

AVVS · 2019-05-09T23:22:53Z

@luin do you think you'd have time to do something about this? :)

luin · 2019-05-15T16:31:09Z

@AVVS Not have time digging into it currently, but likely can find some time for it within the next two months. I think the issue still exists, doesn't it?

AVVS · 2019-11-24T21:59:20Z

@luin the issue still exists, indeed :) do you think you might have time to review the code or come up with a better solution than the one I've coded?

chore(test): easy way to get cluster stuck

fb0deca

AVVS mentioned this pull request Oct 26, 2018

ioredis / flaky cluster connection sequence #709

Closed

fix: promise connection tracking

8b41a36

AVVS requested a review from luin October 27, 2018 00:19

AVVS requested a review from shaharmor November 6, 2018 11:11

stale bot added the wontfix label Dec 7, 2018

luin added pinned and removed wontfix labels Dec 8, 2018

AVVS mentioned this pull request Oct 9, 2020

Add autopipeline for commands and allow multi slot pipelines. #1201

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(test): easy way to get cluster stuck #741

chore(test): easy way to get cluster stuck #741

AVVS commented Oct 26, 2018

AVVS commented Oct 27, 2018

AVVS commented Oct 31, 2018

AVVS commented Nov 7, 2018

stale bot commented Dec 7, 2018

AVVS commented Dec 11, 2018

luin commented Dec 12, 2018

AVVS commented Dec 19, 2018

AVVS commented May 9, 2019

luin commented May 15, 2019

AVVS commented Nov 24, 2019

chore(test): easy way to get cluster stuck #741

Are you sure you want to change the base?

chore(test): easy way to get cluster stuck #741

Conversation

AVVS commented Oct 26, 2018

AVVS commented Oct 27, 2018

Problem that I'm trying to solve:

Workaround

Things to think about

AVVS commented Oct 31, 2018

AVVS commented Nov 7, 2018

stale bot commented Dec 7, 2018

AVVS commented Dec 11, 2018

luin commented Dec 12, 2018

AVVS commented Dec 19, 2018

AVVS commented May 9, 2019

luin commented May 15, 2019

AVVS commented Nov 24, 2019