-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Excessive Bandwidth Consumption Post v36.0 Node Upgrade #231
Comments
@88plug - could you please confirm: 1). Are these nodes being built via Akash Helm Charts? Asking because the Helm Charts sets - 2). During node start up - are there any log entries regarding 0 gas prices? |
Review from additional node operator impacted by increased P2P traffic:
|
It seems that this issue is observed in other networks as well, for example, in the Sentinel Network I'm not sure exactly how this problem arises, but it seems that it spreads through specific peers (full node\RPC) grep "00a39ac3ec012ffa3116a162c17f49df484d0298" .akash/config/config.toml
I'm not sure why this P2P address appears in the address book 123 times 😳 grep "00a39ac3ec012ffa3116a162c17f49df484d0298" .akash/config/addrbook.json | wc -l
123 |
I enabled the sudo journalctl -u akash.service --no-hostname --since "20 minutes ago" | grep -v p2p > akash_20min_log.txt |
Fixes excessive bandwidth #285 I did a battery of tests over the weekend and was able to resolve the issue. The issue appears to be with the p2p seed_mode is set to Cosmos default is pex true and seed mode false. I have updated the Helm charts and tested with seed_mode disabled and the excessive bandwidth issue is resolved. For reference in my testing I also found "error adding vote" will show with 0.025uakt fee. So that may indict some other issue, but it was not related to the bandwidth. |
Issue was caused by IBC relayers allowing zero/very low gas TXs onto the network and into mempool. While Akash RPC/validators are universally configured to reject zero gas TXs, a number of IBC relayers were not configured to reject these TXs. Issue was resolved by: 1). Specific validators intentionally set their min gas requirement to zero to allow these TXs to be written to the chain and thus cleansing the validator mempools of such TXs. 2). Worked with current IBC relayers to ensure they have min gas settings. Network P2P traffic is now normalized. |
@Krewedk0 It's not quite correct.
|
@troian Deleted my last comment to not give bad people good ideas. But i ran some tests last night and you can actually do very nasty stuff with the setup i mentioned. |
Description:
Since upgrading to Akash node release v36.0, nodes have been consuming an unusually high amount of bandwidth, far exceeding the previous usage. This has resulted in "out of bandwidth" notifications across multiple nodes in various datacenters, as well as noticeable lag on residential networks. No changes were made to the default deployment code.
To Reproduce:
Expected Behavior:
The node should have a sustained bandwidth usage of approximately 5,100,000 BPS (5.1 Mbps) for incoming and 6,000,000 BPS (6 Mbps) for outgoing traffic.
Traffic Analysis:
Upon reviewing the provided screenshot:
Attempted Fixes:
I have attempted to limit the P2P connections and adjust the
send_rate
andrecv_rate
parameters in the Cosmos SDK configuration. Despite these efforts, the issue persists.Request:
Please examine the issue deeper and push a fix to stop the irregular bandwidth consumption.
Recommendation:
Anyone running an Akash node should check their bandwidth consumption and traffic to ensure they are not affected by this issue. Create point release for v36.0 that stops excessive bandwidth consumption.
Monthly Traffic View:
Monthly Traffic View:
Before Upgrade Daily:
After Upgrade Daily:
Additional Context:
This issue is critical as it affects the performance and reliability of the nodes across various datacenters and residential networks. Immediate attention and resolution are required.
The text was updated successfully, but these errors were encountered: