-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Expand locking to entire pipeline #620
base: main
Are you sure you want to change the base?
Conversation
I'm not sure this does what we want - by my read this change means that:
I think in practice, this just means that once deploy-master-bosh runs, one of the other jobs has to fail, or deploy-production-bosh has to succeed, before deploy-master-bosh can run again. I think what we want is for only one bosh to deploy at a time - to do that, I think we need to get the lock in each job and release it in each job |
In practice, doesn't the pipeline ordering make it unlikely that multiple BOSHes are deploying at the same time? From the first linked thread, I thought the problem we were tackling with locks was Concourse & BOSH deploying at the same time — or at least, digging into @cweibel's messages, tooling BOSH and either Concourse deploying at the same time. (I say "unlikely" — it's certainly not impossible, without locking as you describe. And I'm going off secondhand info so I could be wrong.) |
one would think, but in practice, it seems to happen fairly often that they deploy multiple times back and forth. I think this is because often several triggers will hit around the same time (a new stemcell + a new version of bosh, for example) Another concern I had in mind in my first comment, but didn't spell out, is that we'll often pause production-bosh for a few different reasons:
In these cases, I don't think we want to lock everything up until prod bosh rolls or a lower bosh fails. My suggestion above fixes that (but it's not the only fix for it - for example, we could just have an
Concourse and BOSH deploying at the same time is one of the noisier failures, but in a perfect world, I'd like no deployment (e.g., |
@bengerman13 @jameshochadel I'd really like to get some form of these changes through to avoid alert fatigue and spurious CI failures Could we have one lock per BOSH deployment (e.g. tooling, development, staging, prod)? Would that address your concerns @bengerman13 ? As long as we had a lock for say when staging BOSH is deploying, then I could prevent OpenSearch from trying to deploy at the same time and failing |
Changes proposed in this pull request:
security considerations
None.