Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A better test data repository is needed. #215

Open
brettviren opened this issue Apr 19, 2023 · 2 comments
Open

A better test data repository is needed. #215

brettviren opened this issue Apr 19, 2023 · 2 comments

Comments

@brettviren
Copy link
Member

As a start, the new testing system introduced in #199 uses a "test data repository" that gets built into build/tests/ by downloading tar files from a web site (mine at BNL). This is easy but it means the versioning of the test data files is not tied to the version of WCT. At least two problems arise:

  • Updating data files will break old software versions.
  • It is easy to write/change tests requiring new data and forget to update the test data repo.

A likely solution is to serve the test data repo via git. Three likely options:

  • plain git. This must contend with Git's inherent "large file problem".
  • git-lfs This is probably the next simplest but results in at least 2 copies of every file on the client side.
  • git-annex This is most efficient but has some learning curve.
  • datalad A simplifying layer on git-annex but requires more software, though it be only a pip install away

Somewhat tied to the choice is how to serve

  • Use www.phy.bnl.gov can accommodate plain git or Gitea + git-lfs or the branched gogs server gin for git-annex support
  • Use git-annex via SSH or other remotes
  • Rely on Gin for free cloud for git-annex
  • Note: GitHub supports git-lfs but the allowed bandwidth under free tier is likely too restrictive.
@brettviren
Copy link
Member Author

Status update

For now, a trivial, low-feature "repository" is provided by distributing tar files from

https://www.phy.bnl.gov/~bviren/tmp/wcttest/data_repo/

More info at:

https://github.com/WireCell/wire-cell-toolkit/blob/master/test/docs/datarepo.org

@brettviren
Copy link
Member Author

#338

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant