Remove model caching mechanism for bert and hbert #42

achyudh · 2019-11-01T02:03:25Z

Fixes issue #9.

* Fix package imports * Update README.md * Fix bug due to TAR/AR attribute check * Add BERT models * Add BERT tokenizer * Return logits from the model.py * Remove unused classes in models/bert * Return logits from the model.py (#12) * Remove unused classes in models/bert (#13) * Add initial main file * Add args for BERT * Add partial support for BERT * Initialize training and optimization * Draft the structure of Trainers for BERT * Remove duplicate tokenizer * Add utils * Move optimization to utils * Add more structure for trainer * Refactor the trainer (#15) * Refactor the trainer * Add more edits * Add support for our datasets * Add evaluator * Split data4bert module into multiple processors * Refactor BERT tokenizer * Integrate BERT into Castor framework (#17) * Remove unused classes in models/bert * Split data4bert module into multiple processors * Refactor BERT tokenizer * Add multilabel support in BertTrainer * Add multilabel support in BertEvaluator * Add get_test_samples method in dataset processors * Fix args.py for BERT * Add support for Reuters, IMDB datasets for BERT * Revert "Integrate BERT into Castor framework (#17)" This reverts commit e4244ec. * Fix paths to datasets in dataset classes and args * Add SST dataset * Add hedwig-data instructions to README.md * Fix KimCNN README * Fix RegLSTM README * Fix typos in README * Remove trec_eval from README * Add tensorboardX to requirements.txt * Rename processors module to bert_processors * Add method to print metrics after training * Add model check-pointing and early stopping for BERT * Add logos * Update README.md * Fix code comments in classification trainer * Add support for AAPD, Sogou, AGNews and Yelp2014 * Fix bug that deleted saved models * Update README for HAN * Update README for XML-CNN * Remove redundant TODOs from the READMEs * Fix logo in README.md * Update README for Char-CNN * Fix all the READMEs * Resolve conflict * Fix Typos * Re-Add SST2 Processor * Add support for evaluating trained model * Update args.py * Resolve issues due to DataParallel wrapper on saved model * Remove redundant Yelp processor * Fix bug for safely creating the saving directory * Change checkpoint paths to timestamps * Remove unwanted string.strip() from tokenizer * Create save path if it doesn't exist * Decouple model checkpoints from code * Remove model choice restrictions for BERT * Remove model/distill driver * Simplify checkpoint directory creation

# Conflicts: # datasets/reuters.py # models/mlp/args.py # models/mlp/model.py

Fixes issue #9

achyudh and others added 30 commits April 13, 2019 23:25

Resolve conflicts in the dev fork

cb14201

Merge branch 'karkaroff-master'

8346514

Resolve merge conflicts in README.md

fff8e0a

Add TREC relevance datasets

0979f77

Add relevance transfer trainer and evaluator

e5f2ee0

Add re-ranking module

57f0680

Add ImbalancedDatasetSampler

7d26d71

Add relevance transfer package

eab4fc2

Fix import in classification trainer

a08b2d1

Merge remote-tracking branch 'castorini/master'

cb3ca31

Remove unwanted args from models/bert

0890eae

Merge remote-tracking branch 'castorini/master'

a8de77c

Fix bug where model wasn't in training mode every epoch

1116c64

Merge remote-tracking branch 'castorini/master'

8c36691

Add Robust45 preprocessor for BERT

0f34aa0

Add support for BERT for relevance transfer

7bed0f1

Add hierarchical BERT model

6c8c728

Remove tensorboardX logging

615fa27

Add hierarchical BERT for relevance transfer

b40cccb

Merge remote-tracking branch 'castorini/master'

70ec667

Add learning rate multiplier

1b031a8

Merge branch 'master' of github.com:castorini/hedwig

a987e2c

Add lr multiplier for relevance transfer

e81cfff

Add MLP model

4758607

Add fastText model

289cde0

Add Reuters bag-of-words dataset class

12a09da

Add input dropout for MLP

bcf1dca

Merge branch 'master' of github.com:castorini/hedwig

7aeded5

# Conflicts: # datasets/reuters.py # models/mlp/args.py # models/mlp/model.py

Remove duplicate README files

448b087

Remove model caching mechanism for bert and hbert

71a2df3

Fixes issue #9

achyudh requested a review from daemon November 1, 2019 02:03

Merge branch 'master' of github.com:castorini/hedwig

7899780

daemon approved these changes Nov 1, 2019

View reviewed changes

daemon merged commit 00f5f99 into castorini:master Nov 1, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove model caching mechanism for bert and hbert #42

Remove model caching mechanism for bert and hbert #42

achyudh commented Nov 1, 2019

Remove model caching mechanism for bert and hbert #42

Remove model caching mechanism for bert and hbert #42

Conversation

achyudh commented Nov 1, 2019