Skip to content

Commit

Permalink
apply dynamic loss scaling (#161)
Browse files Browse the repository at this point in the history
  • Loading branch information
conglongli authored Feb 9, 2022
1 parent 41f4fdf commit 068e656
Show file tree
Hide file tree
Showing 3 changed files with 3 additions and 3 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@
"gradient_clipping": 1.0,
"fp16": {
"enabled": true,
"loss_scale": 1024,
"loss_scale": 0,
"loss_scale_window": 1000,
"hysteresis": 2,
"min_loss_scale": 1
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@
"gradient_clipping": 1.0,
"fp16": {
"enabled": true,
"loss_scale": 1024,
"loss_scale": 0,
"loss_scale_window": 1000,
"hysteresis": 2,
"min_loss_scale": 1
Expand Down
2 changes: 1 addition & 1 deletion Megatron-LM/scripts/ds_zero-offload_10B_config.json
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,7 @@
"gradient_clipping": 1.0,
"fp16": {
"enabled": true,
"loss_scale": 4096,
"loss_scale": 0,
"loss_scale_window": 1000,
"hysteresis": 2,
"min_loss_scale": 1
Expand Down

0 comments on commit 068e656

Please sign in to comment.