Do not mutate RecoveryResponse #37204

dnhatn · 2019-01-07T19:59:08Z

Today we create a global instance of RecoveryResponse then mutate it when executing each recovery step. This is okay for the current sequential recovery flow. However, this is not suitable for an asynchronous recovery which we are targeting. With this commit, we return the result of each step separately and construct a RecoveryResponse at the end.

Relates #37174

elasticmachine · 2019-01-07T19:59:09Z

Pinging @elastic/es-distributed

server/src/main/java/org/elasticsearch/indices/recovery/RecoveryResponse.java

ywelsch · 2019-01-08T13:43:28Z

server/src/main/java/org/elasticsearch/indices/recovery/RecoveryResponse.java

        out.writeVLong(phase1TotalSize);
        out.writeVLong(phase1ExistingTotalSize);
        out.writeVLong(phase1Time);
-        out.writeVLong(phase1ThrottlingWaitTime);
+        if (out.getVersion().before(Version.V_7_0_0)) {
+            out.writeVLong(0L); // phase1ThrottlingWaitTime - not used


it's not used here because it's wrongly implemented (we add the throttle time to the source shard instead of the target shard, see createRecoverySourceHandler in PeerRecoverySourceService. Instead I think it should be send back to the target shard and added there at the end of recovery.

Let's keep the phase1ThrottlingWaitTime here for now, and follow-up with a fix for the actual stats.

Hmm, we send the source throttle time each FileChunk request to the target

elasticsearch/server/src/main/java/org/elasticsearch/indices/recovery/PeerRecoveryTargetService.java

Line 610 in 1ca6666

indexState.addSourceThrottling(request.sourceThrottleTimeInNanos());

Should phase1ThrottlingWaitTime be the total throttle time on both source and target? Note that we currently use RecoveryReponse only for logging purpose. Anyway, I added phase1ThrottlingWaitTime back.

Should phase1ThrottlingWaitTime be the total throttle time on both source and target?

yes, we can transfer the knowledge of the source throttle time as part of the response object back to the target, and then add it to the target throttle time there. An alternative would be to transfer info about source throttle time with each file chunk that we send.

dnhatn · 2019-01-08T16:15:50Z

@ywelsch Thanks for looking. I have addressed your comments.

ywelsch

s/mutable/mutate/ in PR title :-)

dnhatn · 2019-01-08T19:23:36Z

@elasticmachine run gradle build tests 2

dnhatn · 2019-01-08T21:10:51Z

@ywelsch thanks for reviewing.

Today we create a global instance of RecoveryResponse then mutate it when executing each recovery step. This is okay for the current sequential recovery flow but not suitable for an asynchronous recovery which we are targeting. With this commit, we return the result of each step separately, then construct a RecoveryResponse at the end. Relates #37174

Backport of elastic/elasticsearch#37204

Do not mutable RecoveryResponse

1ca6666

dnhatn added >enhancement :Distributed Indexing/Recovery Anything around constructing a new shard, either from a local or a remote source. v7.0.0 v6.7.0 labels Jan 7, 2019

dnhatn requested review from s1monw, ywelsch and DaveCTurner January 7, 2019 19:59

ywelsch suggested changes Jan 8, 2019

View reviewed changes

dnhatn added 2 commits January 8, 2019 10:44

feedback

be20f2f

Merge branch 'master' into recovery-response

913c786

dnhatn requested a review from ywelsch January 8, 2019 16:15

ywelsch approved these changes Jan 8, 2019

View reviewed changes

dnhatn changed the title ~~Do not mutable RecoveryResponse~~ Do not mutate RecoveryResponse Jan 8, 2019

dnhatn merged commit 87ac310 into elastic:master Jan 8, 2019

dnhatn deleted the recovery-response branch January 8, 2019 21:12

dnhatn added the backport pending label Jan 8, 2019

dnhatn removed the backport pending label Jan 12, 2019

colings86 added v7.0.0-beta1 and removed v7.0.0 labels Feb 7, 2019

kovrus added a commit to crate/crate that referenced this pull request Sep 11, 2019

Do not mutate RecoveryResponse.

9c03d9e

Backport of elastic/elasticsearch#37204

kovrus added a commit to crate/crate that referenced this pull request Sep 11, 2019

Do not mutate RecoveryResponse.

4f93e02

Backport of elastic/elasticsearch#37204

kovrus added a commit to crate/crate that referenced this pull request Sep 12, 2019

Do not mutate RecoveryResponse.

1676811

Backport of elastic/elasticsearch#37204

kovrus added a commit to crate/crate that referenced this pull request Sep 12, 2019

Do not mutate RecoveryResponse.

a3f58e2

Backport of elastic/elasticsearch#37204

mergify bot pushed a commit to crate/crate that referenced this pull request Sep 12, 2019

Do not mutate RecoveryResponse.

257e060

Backport of elastic/elasticsearch#37204

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not mutate RecoveryResponse #37204

Do not mutate RecoveryResponse #37204

dnhatn commented Jan 7, 2019

elasticmachine commented Jan 7, 2019

ywelsch Jan 8, 2019

dnhatn Jan 8, 2019

ywelsch Jan 8, 2019

dnhatn commented Jan 8, 2019

ywelsch left a comment

dnhatn commented Jan 8, 2019

dnhatn commented Jan 8, 2019

Do not mutate RecoveryResponse #37204

Do not mutate RecoveryResponse #37204

Conversation

dnhatn commented Jan 7, 2019

elasticmachine commented Jan 7, 2019

ywelsch Jan 8, 2019

Choose a reason for hiding this comment

dnhatn Jan 8, 2019

Choose a reason for hiding this comment

ywelsch Jan 8, 2019

Choose a reason for hiding this comment

dnhatn commented Jan 8, 2019

ywelsch left a comment

Choose a reason for hiding this comment

dnhatn commented Jan 8, 2019

dnhatn commented Jan 8, 2019