Create allennlp-hub repo #3351

matt-gardner · 2019-10-11T19:53:40Z

We're splitting out the models and dataset readers into task-specific repositories, keeping the core abstractions in a more lightweight main library. We still want somewhere to go to get all of the pretrained models, though. We should create a repository called something like AllenNLP Hub that pip installs all of the task-specific repos, then exposes something like our pretrained.py and our sniff tests.

The main repo would then pip install the hub during CI, to run the sniff tests periodically and make sure we're not breaking anything downstream.

The text was updated successfully, but these errors were encountered:

- IIUC, this is most of what's needed to make a formal release, but I haven't tested that. Local-only so far. - Installing dependencies that aren't on pypi proved clunkier than expected. - For now I'm going to set the CI to manually run `pip install --editable allennlp` for on a local checkout. - The allennlp specified in `setup.py` will be excluded with an environment variable. - Prereq for allenai/allennlp#3351.

brendan-ai2 · 2019-10-25T05:24:12Z

I've created https://github.com/allenai/allennlp-hub and an associated build for the sniff tests on Team City.

The basic functionality works, but there's a bit of cleanup to be done. In particular:

Get allennlp-hub properly reviewed. I was merging PRs without reviews to quickly debug TC issues.
Find/create sniff tests for allennlp-semparse. Strangely it looks like the semparse models weren't included in pretrained.py even before the sub-repo split, but I may be missing something.
Push some broken code in a few different combinations and verify that the new build catches the bug.

brendan-ai2 · 2019-10-25T05:26:05Z

@matt-gardner your advice on 2) would be appreciated! I'm out until Tuesday, but I'll wrap up the remaining pieces then.

matt-gardner · 2019-10-28T16:02:36Z

This is awesome, thanks @brendan-ai2! You're right that we never had sniff tests for the parsing models, and I'm not really sure why not.

As for adding things, it would be ideal if we didn't have to modify this repo in order to add a new semantic parser to the hub, just add something to a pretrained.py in allennlp-semparse, or something. That gets a little magical, though, and might not be easy to accomplish.

One option is to do a named approached, similar to "bert-base-cased", where you call hub.get_model(subrepo, model_name). You don't get the benefits of types and autocomplete when doing it that way, though, which is a nice feature of our current pretrained.py.

Probably the right thing to do is just put a few existing models (e.g., ones that were used for papers) into pretrained.py, without worrying about setting something similar up in each sub repo. This is what you've already put in here, it just means adding in a few models from the semparse repo. Probably the right ones are the models that are serving the semantic parsing demos that we have.

- For allenai/allennlp#3351. - Conveniently allenai/allennlp#3361 broke `allennlp_semparse` a while back, so the (AllenNLP Hub Master Build)[http://build.allennlp.org/viewType.html?buildTypeId=AllenNLPHub_Master] should break when this PR merged. - We should then fix `allennlp-semparse` and verify that the build goes green.

- Ensures that the versions of `allennlp` and `allennlp-semparse` specified in `requirements.txt`/`setup.py` are compatible. - Corresponding build: http://build.allennlp.org/viewType.html?buildTypeId=AllenNLPHub_Release - The existing dockerfile and TC build work only on the various `master`s. - For allenai/allennlp#3351

matt-gardner · 2020-01-03T17:21:58Z

@brendan-ai2, can this be closed now?

schmmd · 2020-01-06T21:28:57Z

See github.com/allenai/allennlp-hub

matt-gardner added this to the 1.0.0 milestone Oct 11, 2019

brendan-ai2 self-assigned this Oct 11, 2019

brendan-ai2 mentioned this issue Oct 11, 2019

Remove semantic parsing code #3207

Merged

brendan-ai2 mentioned this issue Oct 24, 2019

Add setup.py so we're pip installable. allenai/allennlp-semparse#10

Merged

This was referenced Nov 2, 2019

Add pretrained models and sniff tests for allennlp_semparse. allenai/allennlp-hub#4

Merged

Add ROADMAP.md to share our quarterly plans publicly. #3427

Merged

brendan-ai2 mentioned this issue Nov 15, 2019

Add Dockerfile.release allenai/allennlp-hub#5

Merged

schmmd closed this as completed Jan 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create allennlp-hub repo #3351

Create allennlp-hub repo #3351

matt-gardner commented Oct 11, 2019

brendan-ai2 commented Oct 25, 2019 •

edited

Loading

brendan-ai2 commented Oct 25, 2019

matt-gardner commented Oct 28, 2019 •

edited

Loading

matt-gardner commented Jan 3, 2020

schmmd commented Jan 6, 2020

Create allennlp-hub repo #3351

Create allennlp-hub repo #3351

Comments

matt-gardner commented Oct 11, 2019

brendan-ai2 commented Oct 25, 2019 • edited Loading

brendan-ai2 commented Oct 25, 2019

matt-gardner commented Oct 28, 2019 • edited Loading

matt-gardner commented Jan 3, 2020

schmmd commented Jan 6, 2020

brendan-ai2 commented Oct 25, 2019 •

edited

Loading

matt-gardner commented Oct 28, 2019 •

edited

Loading