Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(MR): [MR-649] Roll out best-effort calls to the first few subnets #3816

Merged

Conversation

alin-at-dfinity
Copy link
Contributor

List a few subnets in the BestEffortResponsesFeature::SpecificSubnets under the FeatureFlags default. This will have the ic0_call_with_best_effort_response() API call on these subnets actually set a deadline on the calls being constructed. And allow best-effort requests to be routed between these subnets only.

Also reduce the default subnet_message_memory_capacity on all subnets from 25 GB to 15 GB, to make place for a potential 5 GB of best-effort messages plus a potential 1 M callbacks. Apart from brief spikes, usually associated with heartbeat / global timer storms, subnet message memory usage is virtually always in the low single digit GBs, so the reduction of the limit will make no practical difference.

alin-at-dfinity and others added 7 commits January 30, 2025 12:28
Replace the on-off flag with an enum of 4 progressive rollout stages:

 * Stage 0: Trap on related API calls; and reject best-effort requests when routing (status quo).
 * Stage 1: Silently ignore `ic0_call_with_best_effort_response()` calls, falling back to guaranteed response calls; and reject best-effort requests when routing.
 * Stage 2: On system subnets, silently ignore `ic0_call_with_best_effort_response()` calls, falling back to guaranteed response calls; and reject best-effort requests that should be routed to system subnets.
 * Stage 3: Fully enable API calls; always route best-effort requests.

Rollbacks (incremental or direct) are also supported, but it's probably a bad idea to roll back to stage 0, as any canisters already using best-effort calls would break. Badly.
@alin-at-dfinity alin-at-dfinity requested a review from a team as a code owner February 6, 2025 09:13
@github-actions github-actions bot added the feat label Feb 6, 2025
Copy link
Contributor

@derlerd-dfinity derlerd-dfinity left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you. LGTM.

Base automatically changed from alin/MR-649-no-best-effort-calls-to-system-subnets to master February 7, 2025 08:36
@alin-at-dfinity alin-at-dfinity requested review from a team as code owners February 7, 2025 08:36
List a few subnets in the BestEffortResponsesFeature::SpecificSubnets under the FeatureFlags default. This will have the ic0_call_with_best_effort_response() API call on these subnets actually set a deadline on the calls being constructed. And allow best-effort requests to be routed between these subnets only.

Also reduce the default subnet_message_memory_capacity on all subnets from 25 GB to 15 GB, to make place for a potential 5 GB of best-effort messages plus a potential 1 M callbacks. Apart from brief spikes, usually associated with heartbeat / global timer storms, subnet message memory usage is virtually always in the low single digit GBs, so the reduction of the limit will make no practical difference.
@alin-at-dfinity alin-at-dfinity force-pushed the alin/MR-649-partially-roll-out-best-effort-calls branch from 2e80716 to 0fc745a Compare February 7, 2025 08:38
@alin-at-dfinity alin-at-dfinity added this pull request to the merge queue Feb 7, 2025
Merged via the queue into master with commit df5828f Feb 7, 2025
20 of 25 checks passed
@alin-at-dfinity alin-at-dfinity deleted the alin/MR-649-partially-roll-out-best-effort-calls branch February 7, 2025 09:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants