Skip to content

Commit

Permalink
Document the reduction in fault detection timeouts (#40200)
Browse files Browse the repository at this point in the history
The new cluster coordination subsystem introduced in 7.0 will only keep an
unresponsive node in the cluster for 30 seconds, whereas in earlier versions it
might have remained in the cluster for 90 seconds. This commit adds a note to
the migration documentation to that effect.
  • Loading branch information
DaveCTurner authored Mar 20, 2019
1 parent 8f84f45 commit 5eb33f2
Showing 1 changed file with 10 additions and 0 deletions.
10 changes: 10 additions & 0 deletions docs/reference/migration/migrate_7_0/discovery.asciidoc
Original file line number Diff line number Diff line change
Expand Up @@ -46,3 +46,13 @@ The `discovery.zen.no_master_block` setting is now known as
`cluster.no_master_block`. Any value set for `discovery.zen.no_master_block` is
now ignored. You should remove this setting and, if needed, set
`cluster.no_master_block` appropriately after the upgrade.

[float]
==== Reduced default timeouts for fault detection

By default the <<cluster-fault-detection,cluster fault detection>> subsystem
now considers a node to be faulty if it fails to respond to 3 consecutive
pings, each of which times out after 10 seconds. Thus a node that is
unresponsive for longer than 30 seconds is liable to be removed from the
cluster. Previously the default timeout for each ping was 30 seconds, so that
an unresponsive node might be kept in the cluster for over 90 seconds.

0 comments on commit 5eb33f2

Please sign in to comment.