Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GitHub target: reduce memory consumption by streaming uploads/downloads #329

Open
rhcarvalho opened this issue Nov 18, 2021 · 2 comments
Open

Comments

@rhcarvalho
Copy link
Contributor

rhcarvalho commented Nov 18, 2021

The GitHub target should never need to hold full release assets (nor any other potentially large files) in memory.

This issue tracks replacing code that reads full files from disk, downloads arbitrary files from the internet and calculate checksums. All of those steps can be performed with fixed size buffers, consuming less resources and eliminating a failure mode of running out of memory.

See discussion in #328 (comment).

Note: this might require changes to how we use the Octokit library.


Some of the candidates

Reading full files of arbitrary size to memory where we could possibly be stream uploading / calculating checksum:

const file = readFileSync(path);

Calculating MD5 hashes from in-memory bytes where we could be updating the checksum chunk-by-chunk:

craft/src/targets/github.ts

Lines 451 to 453 in 244efd8

private checksumFromData(data: BinaryLike): string {
return createHash('md5').update(data).digest('hex');
}

Downloading arbitrarily large assets from the Internet (into response.data) where we could read chunks and update a checksum:

response = await this.github.request(`GET ${url}`, {

@rhcarvalho
Copy link
Contributor Author

I've put down in the issue description some of the lines of code that I've recently noticed implicitly assuming small release assets.

A more detailed review of the GitHub target might reveal more places where the assumption is made.

A similar problem might exist in other targets.

As far as I know, this problem has not manifested yet in any release of our projects.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant