You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: applications/popart/bert/README.md
+13-23
Original file line number
Diff line number
Diff line change
@@ -39,8 +39,8 @@ Instructions on how to download the Wikipedia and SQuAD datasets can be found in
39
39
40
40
If full pre-training is required (with the two phases with different sequence lengths) then data will need to be generated separately for the two phases:
41
41
42
-
-once with --sequence-length 128 and --mask-tokens 20
43
-
- once with --sequence-length 384 --mask-tokens 60
42
+
- once with --sequence-length 128 --mask-tokens 20 --duplication-factor 6
43
+
- once with --sequence-length 384 --mask-tokens 56 --duplication-factor 6
44
44
45
45
See the `bert_data/README.md file` for more details on how to generate this data.
##### View the pre-training results in Tensorboard
170
169
171
170
`requirements.txt` will install a standalone version of tensorboard. The program will log all training runs to `--log-dir`(`logs` by default). View them by running:
@@ -176,22 +175,13 @@ tensorboard --logdir logs
176
175
177
176
### Run the training loop for pre-training (Wikipedia)
0 commit comments