Huge lists do not migrate fully between cluster nodes #4143

chakaz · 2024-11-18T11:31:51Z

(this may apply to other data types as well)

To reproduce:

Run 2 cluster-mode Dragonfly processes (with with --port=7001 and --port=7002, and pass --cluster_mode=yes to both)
Configure them to have all slots belong to the first:

./cluster_mgr.py --action=config_single_remote --target_port=7001
./cluster_mgr.py --action=attach --target_port=7001 --attach_port=7002

Debug populate a huge list on the node with the slots:

$ redis-cli -p 7001
localhost:7001> debug populate 1 l: 1000 RAND TYPE list ELEMENTS 500000
OK
localhost:7001> llen l::0
(integer) 500000

Migrate all slots to second node:

./cluster_mgr.py --action=migrate --slot_start=0 --slot_end=16383 --target_port=7002

Note the issue:

localhost:7002> llen l::0
(integer) 4096
localhost:7002> memory usage l::0
(integer) 86064

The text was updated successfully, but these errors were encountered:

anadion · 2024-11-19T10:16:11Z

We hame similar problem with RENAME command and big zset keys.

redis_version:6.2.11
dragonfly_version:df-v1.24.0
redis_mode:standalone

127.0.0.1:6379[14]> type ipv4
zset

127.0.0.1:6379[14]> ZCARD ipv6_tmp
(integer) 7391471
127.0.0.1:6379[14]> RENAME ipv6_tmp ipv6
OK
(6.75s)
127.0.0.1:6379[14]> ZCARD ipv6
(integer) 4092

chakaz · 2024-11-19T11:34:31Z

Yes @anadion, that is due to the same root cause. Thanks for reporting!

We have an internal utility tool that we use to deserialize values in some use cases: * `RESTORE` * Cluster slot migration * `RENAME`, if the source and target shards are different We [recently](#3760) changed this area of the code, which caused this regression as it only handled RDB / replication streams. Fixes #4143

* fix: Huge entries fail to load outside RDB / replication We have an internal utility tool that we use to deserialize values in some use cases: * `RESTORE` * Cluster slot migration * `RENAME`, if the source and target shards are different We [recently](#3760) changed this area of the code, which caused this regression as it only handled RDB / replication streams. Fixes #4143

chakaz added the bug Something isn't working label Nov 18, 2024

chakaz self-assigned this Nov 18, 2024

adiholden added this to the dfly cluster v4 milestone Nov 18, 2024

chakaz mentioned this issue Nov 19, 2024

fix: Huge entries fail to load outside RDB / replication #4154

Merged

chakaz closed this as completed in #4154 Nov 20, 2024

chakaz closed this as completed in 24a1ec6 Nov 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Huge lists do not migrate fully between cluster nodes #4143

Huge lists do not migrate fully between cluster nodes #4143

chakaz commented Nov 18, 2024

anadion commented Nov 19, 2024

chakaz commented Nov 19, 2024

Huge lists do not migrate fully between cluster nodes #4143

Huge lists do not migrate fully between cluster nodes #4143

Comments

chakaz commented Nov 18, 2024

anadion commented Nov 19, 2024

chakaz commented Nov 19, 2024