-
Notifications
You must be signed in to change notification settings - Fork 530
[MODEL] BERT conversion scripts, SciBERT, BioBERT, ClinicalBERT #735
Conversation
Codecov Report
|
Codecov Report
@@ Coverage Diff @@
## master #735 +/- ##
==========================================
+ Coverage 89.95% 90.61% +0.65%
==========================================
Files 64 64
Lines 6064 6064
==========================================
+ Hits 5455 5495 +40
+ Misses 609 569 -40
|
@eric-haibin-lin this PR only touches code in the script folder. All other changes are due to #732 and can be ignored while reviewing this. |
2c7a7ba
to
501189c
Compare
Job PR-735/4 is complete. |
501189c
to
d3a9c68
Compare
Job PR-735/5 is complete. |
Typically Optimizer parameters are not exported for Bert models (eg. https://github.com/google-research/bert). As this does not always hold, we have to handle the case that the parameters are still present (eg. https://github.com/allenai/scibert).
d3a9c68
to
20950a7
Compare
20950a7
to
c32f538
Compare
Job PR-735/7 is complete. |
Job PR-735/8 is complete. |
Confirmed that the scibert_scivocab_uncased loaded from PyTorch produces the same output as the tensorflow version (based on the compare_tf_gluon_model.py).
d229a41
to
5614e5b
Compare
Job PR-735/14 is complete. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nice work!
scripts/bert/conversion_tools/infer_pytorch_gluon_parameter_name_mapping.py
Outdated
Show resolved
Hide resolved
Job PR-735/15 is complete. |
Job PR-735/16 is complete. |
Job PR-735/17 is complete. |
This refactors the tf Bert conversion scripts based on #732.
@eric-haibin-lin