Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Speed up rolling-update - longer timeout on validation, less scheduled holds #6747

Merged
merged 1 commit into from
May 20, 2019

Conversation

justinsb
Copy link
Member

@justinsb justinsb commented Apr 7, 2019

We reduce the amount of time we wait on a purely scheduled basis
during a rolling-update, and instead increase the timeout on the
validation.

This will be a behavioural change - particularly for --cloudonly
mode, but that mode seems more useful when things are going wrong
anyway, when people will likely set even lower timeouts.

@k8s-ci-robot k8s-ci-robot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/S Denotes a PR that changes 10-29 lines, ignoring generated files. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Apr 7, 2019
@justinsb
Copy link
Member Author

justinsb commented Apr 7, 2019

WIP because this shouldn't go into 1.12, and will need a release note.

Copy link
Member

@geojaz geojaz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍 lgtm
i'm into this kind of shift in rolling update timing. this looks great on my side-- as long as we sufficiently communicate to users!

should we let this simmer behind a feature flag for a version? or are release notes + a friendly log enough? i'd like to think that this change would never break my workflows, but i can imagine discovering a whole new set of subtle race conditions- which we should fix anyway.

i think my vote would be to disclose in release notes, mainline the change asap, and pray that alpha/beta release catches anything funky. perhaps my reaction is remembering early days of kops :)

@justinsb justinsb changed the title WIP: Speed up rolling-update - longer timeout on validation, less scheduled holds Speed up rolling-update - longer timeout on validation, less scheduled holds May 16, 2019
@k8s-ci-robot k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label May 16, 2019
…d holds

We reduce the amount of time we wait on a purely scheduled basis
during a rolling-update, and instead increase the timeout on the
validation.

This will be a behavioural change - particularly for `--cloudonly`
mode, but that mode seems more useful when things are going wrong
anyway, when people will likely set _even lower_ timeouts.
@k8s-ci-robot k8s-ci-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels May 16, 2019
@justinsb
Copy link
Member Author

Removed WIP, and added release notes for 1.15

I'm using this for my development now and it seems pretty good, and I think we should target this at 1.15 to give it more time to bake. The early kops code simply didn't have any validation logic (was purely time based) and then we hadn't shaken out most of the validation bugs. I'm sure there are more, but I think we've got the big ones!

Copy link
Contributor

@mikesplain mikesplain left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is great and looks like it's ready now!

/lgtm
/approve

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 20, 2019
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: justinsb, mikesplain

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:
  • OWNERS [justinsb,mikesplain]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot merged commit 81c42c5 into kubernetes:master May 20, 2019
@granular-ryanbonham
Copy link
Contributor

We are using these timings with all of our kops 1.12 updates. They have been working great. I would vote to merge this in sooner then 1.15

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants