Smart classnum #29

j-cahill · 2019-08-04T20:28:00Z

added smart processing of the number of classes to Reuters

Fixed paths to include 'models/'

Learning curves

Cuda fix

Add lyrics.py and lyrics_processor.py

Changed class number to correct number, 10

try at fixing cuda usage

Cuda fix

removed local rank argument

Num classes

j-cahill · 2019-08-04T20:29:37Z

sorry, meant to merge this into master of my fork

* Integrate BERT into Hedwig (#29) * Fix package imports * Update README.md * Fix bug due to TAR/AR attribute check * Add BERT models * Add BERT tokenizer * Return logits from the model.py * Remove unused classes in models/bert * Return logits from the model.py (#12) * Remove unused classes in models/bert (#13) * Add initial main file * Add args for BERT * Add partial support for BERT * Initialize training and optimization * Draft the structure of Trainers for BERT * Remove duplicate tokenizer * Add utils * Move optimization to utils * Add more structure for trainer * Refactor the trainer (#15) * Refactor the trainer * Add more edits * Add support for our datasets * Add evaluator * Split data4bert module into multiple processors * Refactor BERT tokenizer * Integrate BERT into Castor framework (#17) * Remove unused classes in models/bert * Split data4bert module into multiple processors * Refactor BERT tokenizer * Add multilabel support in BertTrainer * Add multilabel support in BertEvaluator * Add get_test_samples method in dataset processors * Fix args.py for BERT * Add support for Reuters, IMDB datasets for BERT * Revert "Integrate BERT into Castor framework (#17)" This reverts commit e4244ec. * Fix paths to datasets in dataset classes and args * Add SST dataset * Add hedwig-data instructions to README.md * Fix KimCNN README * Fix RegLSTM README * Fix typos in README * Remove trec_eval from README * Add tensorboardX to requirements.txt * Rename processors module to bert_processors * Add method to print metrics after training * Add model check-pointing and early stopping for BERT * Add logos * Update README.md * Fix code comments in classification trainer * Add support for AAPD, Sogou, AGNews and Yelp2014 * Fix bug that deleted saved models * Update README for HAN * Update README for XML-CNN * Remove redundant TODOs from the READMEs * Fix logo in README.md * Update README for Char-CNN * Fix all the READMEs * Resolve conflict * Fix Typos * Re-Add SST2 Processor * Add support for evaluating trained model * Update args.py * Resolve issues due to DataParallel wrapper on saved model * Remove redundant Yelp processor * Fix bug for safely creating the saving directory * Change checkpoint paths to timestamps * Remove unwanted string.strip() from tokenizer * Create save path if it doesn't exist * Decouple model checkpoints from code * Remove model choice restrictions for BERT * Remove model/distill driver * Simplify checkpoint directory creation * Add TREC relevance datasets * Add relevance transfer trainer and evaluator * Add re-ranking module * Add ImbalancedDatasetSampler * Add relevance transfer package * Fix import in classification trainer * Remove unwanted args from models/bert * Fix bug where model wasn't in training mode every epoch * Add Robust45 preprocessor for BERT * Add support for BERT for relevance transfer * Add hierarchical BERT model * Remove tensorboardX logging * Add hierarchical BERT for relevance transfer * Add learning rate multiplier * Add lr multiplier for relevance transfer

* Integrate BERT into Hedwig (#29) * Fix package imports * Update README.md * Fix bug due to TAR/AR attribute check * Add BERT models * Add BERT tokenizer * Return logits from the model.py * Remove unused classes in models/bert * Return logits from the model.py (#12) * Remove unused classes in models/bert (#13) * Add initial main file * Add args for BERT * Add partial support for BERT * Initialize training and optimization * Draft the structure of Trainers for BERT * Remove duplicate tokenizer * Add utils * Move optimization to utils * Add more structure for trainer * Refactor the trainer (#15) * Refactor the trainer * Add more edits * Add support for our datasets * Add evaluator * Split data4bert module into multiple processors * Refactor BERT tokenizer * Integrate BERT into Castor framework (#17) * Remove unused classes in models/bert * Split data4bert module into multiple processors * Refactor BERT tokenizer * Add multilabel support in BertTrainer * Add multilabel support in BertEvaluator * Add get_test_samples method in dataset processors * Fix args.py for BERT * Add support for Reuters, IMDB datasets for BERT * Revert "Integrate BERT into Castor framework (#17)" This reverts commit e4244ec. * Fix paths to datasets in dataset classes and args * Add SST dataset * Add hedwig-data instructions to README.md * Fix KimCNN README * Fix RegLSTM README * Fix typos in README * Remove trec_eval from README * Add tensorboardX to requirements.txt * Rename processors module to bert_processors * Add method to print metrics after training * Add model check-pointing and early stopping for BERT * Add logos * Update README.md * Fix code comments in classification trainer * Add support for AAPD, Sogou, AGNews and Yelp2014 * Fix bug that deleted saved models * Update README for HAN * Update README for XML-CNN * Remove redundant TODOs from the READMEs * Fix logo in README.md * Update README for Char-CNN * Fix all the READMEs * Resolve conflict * Fix Typos * Re-Add SST2 Processor * Add support for evaluating trained model * Update args.py * Resolve issues due to DataParallel wrapper on saved model * Remove redundant Yelp processor * Fix bug for safely creating the saving directory * Change checkpoint paths to timestamps * Remove unwanted string.strip() from tokenizer * Create save path if it doesn't exist * Decouple model checkpoints from code * Remove model choice restrictions for BERT * Remove model/distill driver * Simplify checkpoint directory creation * Add TREC relevance datasets * Add relevance transfer trainer and evaluator * Add re-ranking module * Add ImbalancedDatasetSampler * Add relevance transfer package * Fix import in classification trainer * Remove unwanted args from models/bert * Fix bug where model wasn't in training mode every epoch * Add Robust45 preprocessor for BERT * Add support for BERT for relevance transfer * Add hierarchical BERT model * Remove tensorboardX logging * Add hierarchical BERT for relevance transfer * Add learning rate multiplier * Add lr multiplier for relevance transfer * Add MLP model

* Integrate BERT into Hedwig (#29) * Fix package imports * Update README.md * Fix bug due to TAR/AR attribute check * Add BERT models * Add BERT tokenizer * Return logits from the model.py * Remove unused classes in models/bert * Return logits from the model.py (#12) * Remove unused classes in models/bert (#13) * Add initial main file * Add args for BERT * Add partial support for BERT * Initialize training and optimization * Draft the structure of Trainers for BERT * Remove duplicate tokenizer * Add utils * Move optimization to utils * Add more structure for trainer * Refactor the trainer (#15) * Refactor the trainer * Add more edits * Add support for our datasets * Add evaluator * Split data4bert module into multiple processors * Refactor BERT tokenizer * Integrate BERT into Castor framework (#17) * Remove unused classes in models/bert * Split data4bert module into multiple processors * Refactor BERT tokenizer * Add multilabel support in BertTrainer * Add multilabel support in BertEvaluator * Add get_test_samples method in dataset processors * Fix args.py for BERT * Add support for Reuters, IMDB datasets for BERT * Revert "Integrate BERT into Castor framework (#17)" This reverts commit e4244ec. * Fix paths to datasets in dataset classes and args * Add SST dataset * Add hedwig-data instructions to README.md * Fix KimCNN README * Fix RegLSTM README * Fix typos in README * Remove trec_eval from README * Add tensorboardX to requirements.txt * Rename processors module to bert_processors * Add method to print metrics after training * Add model check-pointing and early stopping for BERT * Add logos * Update README.md * Fix code comments in classification trainer * Add support for AAPD, Sogou, AGNews and Yelp2014 * Fix bug that deleted saved models * Update README for HAN * Update README for XML-CNN * Remove redundant TODOs from the READMEs * Fix logo in README.md * Update README for Char-CNN * Fix all the READMEs * Resolve conflict * Fix Typos * Re-Add SST2 Processor * Add support for evaluating trained model * Update args.py * Resolve issues due to DataParallel wrapper on saved model * Remove redundant Yelp processor * Fix bug for safely creating the saving directory * Change checkpoint paths to timestamps * Remove unwanted string.strip() from tokenizer * Create save path if it doesn't exist * Decouple model checkpoints from code * Remove model choice restrictions for BERT * Remove model/distill driver * Simplify checkpoint directory creation * Add TREC relevance datasets * Add relevance transfer trainer and evaluator * Add re-ranking module * Add ImbalancedDatasetSampler * Add relevance transfer package * Fix import in classification trainer * Remove unwanted args from models/bert * Fix bug where model wasn't in training mode every epoch * Add Robust45 preprocessor for BERT * Add support for BERT for relevance transfer * Add hierarchical BERT model * Remove tensorboardX logging * Add hierarchical BERT for relevance transfer * Add learning rate multiplier * Add lr multiplier for relevance transfer * Add MLP model * Add fastText model * Add Reuters bag-of-words dataset class * Add input dropout for MLP * Remove duplicate README files * Remove model caching mechanism for bert and hbert Fixes issue #9

j-cahill and others added 30 commits July 19, 2019 21:53

Update setup.py package list

ff04385

Fixed paths to include 'models/'

Update README.md

a4e8311

modified training code to create learning curve figures

b4778e9

bug fix - learning curves

5531f0d

Merge pull request #1 from j-cahill/learning_curves

6402478

Learning curves

Update requirements.txt

2c2beff

fixed cuda bug for HAN

3ef89c3

attempt to make CUDA usage for HAN resemble BERT code

c2b1fcf

attempt to make CUDA usage for HAN resemble BERT code -2

454c5f3

CUDA fix for LSTM

9825e79

Merge pull request #2 from j-cahill/cuda_fix

43d0b98

Cuda fix

Add files via upload

4ee01c7

Merge pull request #3 from j-cahill/naotominakawa-patch-1

15c6f65

Add lyrics.py and lyrics_processor.py

modified args and dataset map to include lyrics arguments

cf8e101

Changed class number to correct number, 10

b55cdc9

Merge pull request #4 from j-cahill/classnum_bug

e402455

Changed class number to correct number, 10

Update lyrics_processor.py

e02c98f

Update lyrics_processor.py

edfc7cd

Update __main__.py

e13fe20

try at fixing cuda usage

fixed cuda loading for all models

d0ac4b0

Fix for weight_drop error

c188fb5

added local-rank arg

2b5b609

fixed local rank to only be in models/args

ccd8d8a

attempt at char_cnn file not found fix

11e6d00

char-cnn fix

2f34d5c

char_cnn fix

eb51e2b

char_cnn fix

42c596b

char_cnn fix

b8215c7

LSTM fix

aac329a

LSTM fix

ad2b8c9

j-cahill added 19 commits August 1, 2019 23:53

LSTM fix

9dba709

added Lyrics arg

fd8276f

added lyrics dataset to all __main__ files

51cd6a2

added lyrics to evaluators

ed7882a

added lyrics to train

8221788

Merge pull request #5 from j-cahill/cuda_fix

dba4907

Cuda fix

Update args.py

587f705

removed local rank argument

fix for num_classes for bert

d41615f

removed local-rank argument from bert.args

bdaec31

monitoring class number and multilabel

ef4d618

test

8ee8bb2

fixed sst_processor code

b3837c6

multilabel true for testing

3b53a5b

fixed evaluation metrics for 2 class problem

46f60d8

changed pos_label to 1

37e7ad2

removed testing print statements

1ce9b95

Merge branch 'master' into num_classes

52f7bf2

Merge pull request #6 from j-cahill/num_classes

4a0faff

Num classes

infer class number from actual dataset

8883909

j-cahill closed this Aug 4, 2019

j-cahill deleted the smart_classnum branch August 4, 2019 20:28

j-cahill restored the smart_classnum branch August 4, 2019 20:30

j-cahill deleted the smart_classnum branch August 4, 2019 20:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Smart classnum #29

Smart classnum #29

j-cahill commented Aug 4, 2019

j-cahill commented Aug 4, 2019

Smart classnum #29

Smart classnum #29

Conversation

j-cahill commented Aug 4, 2019

j-cahill commented Aug 4, 2019