You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Nov 15, 2023. It is now read-only.
We are running our parachain testnet on ChaChaCha. About once a few days, we see a large portion of the network nodes got stall with Storage root must match that calculated error while importing the same block, but there's one collator (collator1) still producing blocks and can grow the parachain.
This probably caused a network split. On telemetry, we used to see the only running collator still producing blocks with a txpool size of 0, while all the other nodes had thousands of pending extrinsics.
We see all the nodes besides collator1 accepted 63240, but rejected 63241 with the error below:
I've checked block 63241. Everything looks normal (polkadotjs). The only extrinsic sync_offchain_message is to report a message to our pallet. It doesn't really do much except checking the signature and increment a storage map value by one (message id). 300+ similar messages had been successfully executed before that block, and we cannot tell any difference from it to the others.
Walkaround: We have deleted the parachain database of all the nodes except collator1, and then all the nodes got synced well.
(Could be a different problem:) There are still a few nodes untouched, but they got stuck with a different symptom. They were stuck earlier than 63240. I can see new blocks got imported, but best is always stuck at 63217:
For this node, ctrl-c doesn't kill the process. I have to send SIGKILL. After restarting this node, it shows a Reorg on #63217 and then it got synced to the latest block.
Other information
The parachain runtime and the collator node is based on v0.9.8; polkadotjs
The relay chain is ChaChaCha, based on v0.9.7; polkadotjs
All the nodes never lose peers
ChaChaCha has been always producing blocks correctly
Dumped logs for the stuck collators (collator2/3): link (Search "Storage root must match that calculated" to locate the problem)
We are running our parachain testnet on ChaChaCha. About once a few days, we see a large portion of the network nodes got stall with
Storage root must match that calculated
error while importing the same block, but there's one collator (collator1) still producing blocks and can grow the parachain.This probably caused a network split. On telemetry, we used to see the only running collator still producing blocks with a txpool size of 0, while all the other nodes had thousands of pending extrinsics.
We see all the nodes besides collator1 accepted
63240
, but rejected63241
with the error below:I've checked block
63241
. Everything looks normal (polkadotjs). The only extrinsicsync_offchain_message
is to report a message to our pallet. It doesn't really do much except checking the signature and increment a storage map value by one (message id). 300+ similar messages had been successfully executed before that block, and we cannot tell any difference from it to the others.Walkaround: We have deleted the parachain database of all the nodes except collator1, and then all the nodes got synced well.
(Could be a different problem:) There are still a few nodes untouched, but they got stuck with a different symptom. They were stuck earlier than
63240
. I can see new blocks got imported, butbest
is always stuck at63217
:For this node, ctrl-c doesn't kill the process. I have to send SIGKILL. After restarting this node, it shows a
Reorg on #63217
and then it got synced to the latest block.Other information
Related issue
#532
The text was updated successfully, but these errors were encountered: