Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Source distribution does not observe the file/path excludes and is too big #294

Closed
1 task done
ceteri opened this issue Nov 24, 2022 · 7 comments · Fixed by #297
Closed
1 task done

Source distribution does not observe the file/path excludes and is too big #294

ceteri opened this issue Nov 24, 2022 · 7 comments · Fixed by #297
Labels
bug Something isn't working help wanted Extra attention is needed

Comments

@ceteri
Copy link
Collaborator

ceteri commented Nov 24, 2022

I'm submitting a

  • bug report (CRITICAL)

This subsumes #275

Currently when using either pyproject.toml or setup.py approaches to build source and wheel, the wheel for a release is ~400Kb while the source distribution is ~300Mb, which is too large for PyPi.

Note that the former was moved to wip/pyproject.toml since its use with Flit appeared to be bugging. Perhaps Poetry works better?

See the MANIFEST.in file, where there are both include and exclude statements getting used. For some reason, these rules are followed when building a wheel, but not when building a source distribution (sdist)

@Mec-iS @tomaarsen @jake-aft - do you have any suggestions about the library packaging here could be structured better for avoid this issue?

@ceteri ceteri added bug Something isn't working help wanted Extra attention is needed labels Nov 24, 2022
@jake-aft
Copy link
Contributor

jake-aft commented Nov 24, 2022

@ceteri i have pushed up a WIP of the poetry conversion. So there is a pyproject.toml set up for poetry, but have not got to the point to convert to build the package as yet.

@ceteri
Copy link
Collaborator Author

ceteri commented Nov 24, 2022

Many thanks @jake-aft

I have a hunch that the Python docs somehow change substantially between 3.8 and latest online (3.11) so that the reserved words here are different: https://docs.python.org/3/distutils/commandref.html#sdist-cmd

@ceteri
Copy link
Collaborator Author

ceteri commented Nov 24, 2022

For whatever reason the "egg" part of a source distribution ignores the MANIFEST.in rules and just decides on its own to include every file in the directory, recursively, which produces a sdist that's too large for PyPi to upload. This is with setup.py

I've tried to use pyproject.toml instead, however I cannot get it to follow any of the rules as advertised. We'll need to fix this, somehow.

@jake-aft
Copy link
Contributor

@ceteri maybe its silly, but could we chmod the dirs we want to exclude and make them not readable, and then build the package? Not a great thing, but maybe for a workaround if we need to get the build out?

@ceteri
Copy link
Collaborator Author

ceteri commented Nov 24, 2022

Instead of directory permissions, tt'd probably be good if the SPARQL tests could pull down benchmark data from another source, instead of having that all be included in the repo.

That said, there must be ways to exclude paths and files from a source distribution. I've been testing, but there are changes in the Python build specs between 3.8 and 3.10-3.11, which is right where pyproject.toml begins to be required by pip v23+

Something between the docs' diffs and my brain and the Py tools impl just doesn't quite work as advertised? :)

@ceteri
Copy link
Collaborator Author

ceteri commented Nov 24, 2022

@jake-aft if poetry doesn't resolve this,
i can check in with Python Software Foundation, where they've been having some "debates" regarding this topic, IIRC ...

@ceteri
Copy link
Collaborator Author

ceteri commented Nov 24, 2022

Found the issue, which is explained here:
https://stackoverflow.com/questions/608855/excluding-a-top-level-directory-from-a-setuptools-package

TL;DR:
rm -rf dist build *.egg-info
prior to running a build for a new release

In our case, this results in 10:1 reduction of the source distribution.
Entirely not documented, AFAICT

@ceteri ceteri linked a pull request Nov 24, 2022 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working help wanted Extra attention is needed
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants