backend: do not copy buffer when creating read tx #12529

mlmhl · 2020-12-08T08:21:06Z

Currently, the backend buffer is copied once for each read request, which may brings significant additional overhead. For example, in a Kubernetes cluster, all write requests are Txn, which triggers a read request to check the Comapre assertion. What's more, kube-apiserver watches etcd with previous kv required, so each watch event also triggers a read request. In a busy Kubernetes cluster, there will be many read and write requests at the same time, resulting in a large buffer and a large number of buffer copy operations.

However, as the buffer is managed as a sorted array, the overhead of a read operation is less than that of copying the entire buffer, so we can remove the buffer copy operation and just hold the read lock when invoke the buffer's range operation.

I developed a simple test tool, which can generate concurrent read and write requests at the same time, and tested the version before and after optimization. Below is a preliminary test result: the size of key is set to 64, concurrency of read and write operation is 500, read and write requests are executed 10W times and 30W times respectively. It seems that this optimization can significantly improve the performance of read operations.

value size	read qps	optimized read qps	write qps	optimized write qps
128	11313	24093	6276	13118
256	11538	24229	6405	12997
512	11669	23000	6447	12062
1024	10999	19594	6285	10036
2048	10354	13780	5595	6157
3072	10202	12112	5197	5297
4096	8217	9351	3773	3794
5120	7507	8564	3448	3462

tangcong · 2020-12-08T10:32:34Z

@mlmhl if etcd holds read lock will cause "large read blocking write" issue (#10525) in mvcc/backend. This PR solves it.

ptabor · 2021-01-31T11:27:16Z

server/etcdserver/api/membership/cluster.go

@@ -704,8 +704,6 @@ func clusterVersionFromBackend(lg *zap.Logger, be backend.Backend) *semver.Versi
 func downgradeInfoFromBackend(lg *zap.Logger, be backend.Backend) *DowngradeInfo {
 	dkey := backendDowngradeKey()
 	tx := be.ReadTx()
-	tx.Lock()


Could you please comment why this transaction does not need to take 'lock' and its worth not taking the lock ?

The method seems to be called very infreqently (during Recovery).

ptabor · 2021-01-31T21:22:44Z

Very promising improvement. Thank you. Could you, please, retrigger the test.

@jingyih could you, please, take second look. It LGTM, but you have worked a lot in this area.

ptabor · 2021-05-14T21:27:32Z

I assume its obsoleted by:

#12933.

Thank you for the idea.

mlmhl force-pushed the buffer-no-copy branch from 711d01d to 6e82239 Compare December 10, 2020 04:04

backend: do not copy buffer when creating read tx

203977d

mlmhl force-pushed the buffer-no-copy branch from 6e82239 to 203977d Compare December 10, 2020 10:40

ptabor self-assigned this Jan 15, 2021

ptabor added the area/performance label Jan 31, 2021

ptabor reviewed Jan 31, 2021

View reviewed changes

ptabor assigned ptabor and jingyih and unassigned ptabor Jan 31, 2021

ptabor requested a review from jingyih February 16, 2021 08:53

ptabor mentioned this pull request Feb 17, 2021

transaction logic improvement #12692

Closed

ptabor added this to the etcd-v3.5 milestone Mar 1, 2021

ptabor added the rel/3.5-nice-to-have label Mar 30, 2021

ptabor closed this May 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

backend: do not copy buffer when creating read tx #12529

backend: do not copy buffer when creating read tx #12529

mlmhl commented Dec 8, 2020

tangcong commented Dec 8, 2020 •

edited

Loading

ptabor Jan 31, 2021 •

edited

Loading

ptabor commented Jan 31, 2021

ptabor commented May 14, 2021

backend: do not copy buffer when creating read tx #12529

backend: do not copy buffer when creating read tx #12529

Conversation

mlmhl commented Dec 8, 2020

tangcong commented Dec 8, 2020 • edited Loading

ptabor Jan 31, 2021 • edited Loading

Choose a reason for hiding this comment

ptabor commented Jan 31, 2021

ptabor commented May 14, 2021

tangcong commented Dec 8, 2020 •

edited

Loading

ptabor Jan 31, 2021 •

edited

Loading