Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Include Github as a download mirror #844

Open
Chocobo1 opened this issue Jan 1, 2024 · 10 comments
Open

Include Github as a download mirror #844

Chocobo1 opened this issue Jan 1, 2024 · 10 comments

Comments

@Chocobo1
Copy link

Chocobo1 commented Jan 1, 2024

Currently there are only 2 download site for boost. One is jfrog which is still unavailable at the time of current writing. Another one is sourceforge which is slow and I had faced reliability issues in the past.
I propose to utilize Github as another (backup) download site.

Boost is already putting release artifacts on github: https://github.com/boostorg/boost/releases
The script for creating those artifacts is here: https://github.com/boostorg/boost/blob/master/.github/workflows/release.yml
However those artifacts on github are not the same as from jfrog or sourceforge, they have different file tree layout and different hashes. Please consider fixing them or upload the official ones.

ps. nowadays Github Actions has non trivial amount of CI jobs running (for various projects) and those CI jobs would benefit (in download speed and reliability) if boost can be fetched over the same github domain.

@mclow
Copy link
Contributor

mclow commented Jan 1, 2024

The Boost libraries are incredibly popular, and are downloaded 1000s of times each day.
Before we put up any other mirrors, we need to discuss it with the providers to (at the very least) let them know what we are doing.

If you know someone who is willing to donate several 100 TB of bandwidth/month, we'd love to talk to them.

Hopefully, the JFrog links will be restored soon.

@valiko-ua
Copy link

valiko-ua commented Jan 2, 2024

As an alternative, consider splitting archives:

  • sources only
  • documentation only
  • tests only
  • built binaries for specific platform (this seems already done)

Most users need just the first one. This will greatly decrease required bandwidth.

@userdocs
Copy link

userdocs commented Jan 5, 2024

First request of 2024 to use Github releases for boost. 🥳

At any point will you actually ask Github instead of using this copy and paste reply

This is such a weird flex to take that somehow your release assets will cripple Github and you are so afraid of some perceived repercussion you never actually resolve the issue.

Caution

They probably won't even notice this repo exists even with the release assets for INTERNAL traffic to other repos using the assets in actions. You act like they will put you in jail.

Either:

  • ASK THEM, which will probably just amount to a discussions topic with no commitment from them.
  • or just do it and see what happens.

Really, what is the worst that will happen? They ask you to stop using releases? At least you'll then have a valid reason.

After the numerous previous attempts at a reasonable outcome to this subject some things occur to me:

1: I still have no idea why you think that amount of transfer is a lot, especially to Github. Really, it's not.
2: The majority of traffic will be internal network traffic used in workflows, This is not the burden you make it out to be.
3: They probably won't bite if you ask or care if you do it without asking them.
4: There are much bigger projects on Github doing it, why are you special?
5: I can just host it myself on Github and move on.

And that is exactly what I have been doing for some time as externally hosted assets are unreliable for workflows and using Github to track my dependencies is far mor reliable.

https://github.com/userdocs/qbt-workflow-files/releases/latest

I just came here to lol at the reply. It's just silly at this point.

@userdocs
Copy link

userdocs commented Jan 7, 2024

First, part of the OP's issue was never addressed so i'll link to me asking it (why the release assets are different)
#753 (comment)

Second, I made an automated mirror of the https://archives.boost.io/release/ assets as it really is that simple.

https://github.com/userdocs/boost/releases/latest

Third: How many times does boost need to have cdn drama for this it not be an issue?

@Chocobo1
Copy link
Author

Chocobo1 commented Jan 7, 2024

JFrog is down again...

Here is the relevant documentation from Github about bandwidth usage:

Storage and bandwidth quotas
Each file included in a release must be under 2 GiB. There is no limit on the total size of a release, nor bandwidth usage.

I would take it literally unless they say it isn't.

Before we put up any other mirrors, we need to discuss it with the providers to (at the very least) let them know what we are doing.

Sure, it is a nice thing to do. But at the same time I'm not sure there is actually interest in Boost to take this step.
Anyway, here is the form to submit a support request: https://support.github.com/contact?tags=rr-general-technical

@edmcman
Copy link

edmcman commented Jan 7, 2024

Or use jsdelivr?

@userdocs
Copy link

userdocs commented Jan 7, 2024

Or use jsdelivr?

As a recommendation for a backup mirror perhaps but the point that should be focused on here is that there is no valid to reason to avoid using Github as the main release platform and mirror that out to backup cdn solutions. The reason(s) constantly given as to why it is not/has not been done are simply out of touch, wrong and misleading. I thought they were dumb 2 years ago and now I think it's a meme.

Github's own TOS, platform documentation and comparable projects directly support the idea it can be done and contradict claims that boost is some unreasonable burden on the platform.

It's also not really a good reflection on the project to be constantly mired by this cdn drama when they could just use Github because that is what the majority of people need. To use boost in their Github Workflows.

The only sane outcome here is that the proposal is accepted and road mapped so no one ever has to wonder if their boost workflow is a coinflip.

@userdocs
Copy link

As my final contribution to this spectacle I thought I'd expand the potential of the demo mirror repo to be an easy to fork and reproduce mirror system that can either just host source archives or both source and binary archives, depending on what's needed. I saw 2 people download the binary files so figured I had to account for them.

https://github.com/userdocs/boost

Though it's a proof of concept mirror repo it is fully functional, automated and will probably just work to mirror future releases as they are released so I'll just leave it there and see what happens.

Regardless, I don't have to worry about future cdn issues, when it happens again.

@dimpase
Copy link

dimpase commented Mar 19, 2024

I've never heard of any project afraid of mirroring small, by modern standards, tarballs on GitHub.

@trim21
Copy link

trim21 commented Jan 1, 2025

Or use jsdelivr?

jsdelivr have a hard limit at 50mib and it only serve files in npm or github repo.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants