-
Notifications
You must be signed in to change notification settings - Fork 140
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tracking issue for impact of 2024 MacOS fleet expansion #1981
Comments
FYI @keith , not sure the latest rules_apple failures are related to this or not: https://buildkite.com/bazel/rules-apple-darwin/builds/9311 |
yea they are, looks like there was an Xcode update as part of this, which is good, tracking fix here bazelbuild/rules_apple#2488 |
there are some BEP-related errors in this presubmit that only happen on macOS and don't go away after retries: https://buildkite.com/bazel/bcr-presubmit/builds/6541#019092db-1572-49fe-b7e3-2f489764d67a |
It looks like rules_jvm_external is having trouble on the new macs with whatever version of java they have:
I can try updating the version of rules_jvm_external, it looks like the latest version doesn't have this |
I also notice that the mac arm64 tests on rules_android are failing with "Bad CPU type in executable" when it tries to execute aapt2, I haven't looked into that further |
hmm. Also seeing macOS jobs on Bazel 6.x stuck on |
bazelbuild/rules_android#244 fixes the |
Ok so downloading the android sdk build tools for mac: and then examining the then the latest version 35.0.0: The currently installed 30.0.3 has only x86_64, and the latest version is both a x86_64 and arm64 binary. The arm64 tests were passing before, but I'm not sure how or why. I guess there was some x86_64 emulation going on that's not enabled on the new runners? In any case, we should just update the android sdk build tools on the CI images (https://github.com/bazelbuild/continuous-integration/blob/master/macos/mac-android.sh etc) to the latest build tools 35.0.0 |
Probably unrelated to macOS migration, see bazelbuild/bazel-central-registry#2373 (comment) |
If your builds on Intel macOS frequently run into timeouts like bazel-contrib/rules_go#3969 (comment) after this change, this is because the machines in the new fleet is less powerful than previous ones (2 core Mac Mini vs 20 core iMac Pro) Consider applying the following changes:
|
This is caused by |
rules_foreign_cc is having issues with the new macOS runners too: https://buildkite.com/bazel/rules-foreign-cc/builds/5745#_ |
Probably mesonbuild/meson#12282 ? |
Looks like it but the fix is probably something that the apple toolchain should apply, as rules_foreign_cc can't really know the version of xcode and when its needed? Any opinions @keith? |
…sue on new mac CI machines. See bazelbuild/continuous-integration#1981 (comment) Closes #244 Closes #242 PiperOrigin-RevId: 650660707 Change-Id: I3e93a893684786951e1f345ee9749ee62ec7049e
hrm seems like the repo's files should deal with this if they need it? theoretically you could know what the current xcode version was if you wanted, but unless you're setting that version wrong today I don't think that would help with downstream issues? |
- Upgraded Bazel to 7.2.1 to include remote cache fixes - Backport changes to adapt bazelbuild/continuous-integration#1981 --------- Co-authored-by: Googler <[email protected]>
https://buildkite.com/bazel/bcr-presubmit/builds/6641#01909d90-e33f-49cf-ba58-fcd588073673/415-445 |
Any chance I could get some help with one of the MacOS failures on my PR? It seems like its probably related to this issue, please let me know if there's any configurations I should be changing here. This is my first contribution here so any advice welcome! bazel-contrib/bazel-gazelle#1822 |
Likely unrelated, I don't have access to retry it but hopefully someone can |
@lkassar-stripe The timeout is likely caused by the Bazel binary cannot reach to the network in an ipv6-only network. Unfortunately, the test framework needs to inject |
@coeuvre Do you know what's happening? I'm also seeing
from time to time. |
Hard to tell from this error message. Do you have link to the build which produced the error? |
Yes, this one #1981 (comment) |
@meteorcloudy I am having problem in Bazel.git as well. Currently on my MacOS, to run a shell test, I would have to apply this patch
This is the test command I am running on my mac Without commenting out the ipv6 flags, I would get
|
Yeah, this is the same problem. We probably should also remove /cc @fweikert |
I wonder if it's a good opportunity to make this a platform constraint somehow and augment the tests with a |
Filed #2003 to track the Intel MacOS problem with test analytics (caused by this migration). |
Possible MacOS issue here: https://buildkite.com/bazel/bazel-bazel-github-presubmit/builds/22496#0190dcaf-a704-424e-9441-dd7426239485
|
Im also getting a new persistent issue on a specific MacOS shard:
I did try to push another commit to re-trigger CI for this PR but the same issue happened again. |
And it happened on the same machine bk-macos-pln3-90qy (build 1, build 2). @fweikert Can we report such errors back to the internal team offering the VMs? |
For my builds a few of the macOS ones failed with |
@FaBrand We are deploying an update to the VMs, sorry for the inconvenience. |
Thanks for the Info @meteorcloudy :) Do i have to do something, to retrigger my checks? |
There is already a retry job scheduled for each lost agent, they will run as soon as the VMs are online. |
I'm not sure if it's related to the new Mac CI workers, but as this issue is prominently featured on buildkite, I'll report it here. If you think it's unrelated, I'm happy to report this as seperate issue elsewhere. I'm hitting
on https://buildkite.com/bazel/bazel-bazel-github-presubmit/builds/22520#0190e1c7-b7fe-4b6b-934c-624bee706f7d which is an exception I've never come across yet. |
@meteorcloudy FYI There is one shard that wasn't retried again after 4 Agent losses: |
I guess someone retriggered it, thank you dear anonymous helper 💚 |
bazel/buildtools:
|
We need the fix from bazelbuild/bazel#18444, which is only included in 7.3.0. We can pin Bazel version for buildtools to 7.3.0rc1. /cc @vladmos |
Facing the following build error for MacOS, related to this merge request |
CI infra problems for the new macOS fleet should have been addressed already, closing this one for now |
This marks the end of bazelbuild#1981
This marks the end of bazelbuild#1981
This marks the end of bazelbuild#1981
We've moved most of our MacOS CI workload from lab machines to a larger number of less powerful VMs (#1708).
Please reply to this issue if you've encountered any MacOS issues related to the migration.
The text was updated successfully, but these errors were encountered: