Skip to content
This repository has been archived by the owner on Jan 15, 2024. It is now read-only.

Commit

Permalink
fix the hyperparameters for Electra-Large on squad (#1396)
Browse files Browse the repository at this point in the history
* fix the hyperparameters for squad

* typo

* update
  • Loading branch information
ZiyueHuang authored Oct 22, 2020
1 parent 149270a commit 46c9d01
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 3 deletions.
3 changes: 2 additions & 1 deletion scripts/question_answering/commands/generate_commands.py
Original file line number Diff line number Diff line change
Expand Up @@ -61,7 +61,8 @@ def electra_large_cfg():
cfg.model_name = 'google_electra_large'
cfg.batch_size = 2
cfg.num_accumulated = 4
cfg.lr = 1e-5
cfg.lr = 5e-5
cfg.max_grad_norm = 1
cfg.layerwise_decay = 0.9
return cfg

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -6,11 +6,11 @@ MODEL_NAME=google_electra_large
BATCH_SIZE=2
NUM_ACCUMULATED=4
EPOCHS=2
LR=1e-05
LR=5e-05
WARMUP_RATIO=0.1
WD=0
MAX_SEQ_LENGTH=512
MAX_GRAD_NORM=0.1
MAX_GRAD_NORM=1
LAYERWISE_DECAY=0.9

# Prepare the Data
Expand Down

0 comments on commit 46c9d01

Please sign in to comment.