[BUGFIX] fix NTA implementation #1277

congxie1108 · 2020-07-27T20:45:29Z

Description

Resolve the issue #1253
Currently, the average trigger checks “val_L > min(valid_losses[-n:])”.
In this patch, I change the implementation into the version published in ICLR, [1], the algorithms is actually “val_L > min(valid_losses[:-n])”, which is also used in Salesforce’s source code: [2]

References

[1] https://openreview.net/pdf?id=SyyGPP0TZ
[2] https://github.com/salesforce/awd-lstm-lm/blob/32fcb42562aeb5c7e6c9dec3f2a3baaaf68a5cb5/main.py#L275

cc @dmlc/gluon-nlp-team

mli · 2020-07-27T21:22:29Z

Job PR-1277/1 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-1277/1/index.html

codecov · 2020-07-27T22:36:19Z

Codecov Report

Merging #1277 into master will increase coverage by 0.27%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #1277      +/-   ##
==========================================
+ Coverage   87.45%   87.72%   +0.27%     
==========================================
  Files          81       81              
  Lines        7365     7365              
==========================================
+ Hits         6441     6461      +20     
+ Misses        924      904      -20

Impacted Files	Coverage Δ
src/gluonnlp/data/word_embedding_evaluation.py	`96.93% <0.00%> (+7.66%)`	⬆️

congxie1108 · 2020-07-31T22:49:12Z

@sxjscience @szha I've updated the results reported in the webpage in this pr.
Although the results of language models look fine to me, the cached language models seem to have a significant performance regression.
The updated logs could be found in dmlc/web-data#259

[BUGFIX] fix NTA implementation (dmlc#1277)

fix NTA

1c69716

congxie1108 requested a review from a team as a code owner July 27, 2020 20:45

congxie1108 mentioned this pull request Jul 27, 2020

Implementation of Non-monotonically Triggered AvSGD mismatches the ICLR paper #1253

Open

sxjscience approved these changes Jul 28, 2020

View reviewed changes

congxie1108 added 2 commits July 31, 2020 17:42

update language model webpage

6c0bdb6

update language model webpage

f18efce

congxie1108 mentioned this pull request Jul 31, 2020

update the logs of language models dmlc/web-data#259

Merged

congxie1108 added 2 commits August 3, 2020 20:22

trigger ci test again

73ffb60

trigger ci test again

d8379f8

szha merged commit 528283d into dmlc:master Aug 6, 2020

jamiekang added a commit to jamiekang/gluon-nlp that referenced this pull request Aug 11, 2020

Merge pull request #5 from dmlc/master

55a1d46

[BUGFIX] fix NTA implementation (dmlc#1277)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUGFIX] fix NTA implementation #1277

[BUGFIX] fix NTA implementation #1277

congxie1108 commented Jul 27, 2020

mli commented Jul 27, 2020

codecov bot commented Jul 27, 2020 •

edited

Loading

congxie1108 commented Jul 31, 2020

[BUGFIX] fix NTA implementation #1277

[BUGFIX] fix NTA implementation #1277

Conversation

congxie1108 commented Jul 27, 2020

Description

References

mli commented Jul 27, 2020

codecov bot commented Jul 27, 2020 • edited Loading

Codecov Report

congxie1108 commented Jul 31, 2020

codecov bot commented Jul 27, 2020 •

edited

Loading