Megatron-DeepSpeed
add `pad-vocab-size-to` argument and tests
#255
Merged

add `pad-vocab-size-to` argument and tests #255

SaulLu merged 56 commits into main from LS/vocab_size
SaulLu
SaulLu init new test
2025ac24
SaulLu test pad vocab size to
c33343d2
SaulLu add logs
390b4dc1
SaulLu log to warning
784b7512
SaulLu change TP
6f3a4721
SaulLu fix loop
1d9649af
SaulLu revert
7fa5c103
SaulLu remove hack size
bcc6d8de
SaulLu this new test should pass
9e17a4f7
SaulLu test not divisible by num tp
92614bf7
SaulLu Revert "remove hack size"
8322f897
SaulLu Revert "Revert "remove hack size""
6d720734
SaulLu Revert "test not divisible by num tp"
84333d35
SaulLu Revert "this new test should pass"
b2382d84
SaulLu change info to warning
d4a15a3b
SaulLu change to print
cd5e8b4e
SaulLu add print
a6ee8947
SaulLu test 2
f534c43f
SaulLu new print
0a1167b0
SaulLu woups
34bfd60d
SaulLu more
50cb3ca5
SaulLu woups
786e02dc
SaulLu comment
20d08a85
SaulLu raise errors
915bd6c7
SaulLu woups
119a0d2d
SaulLu pad to save vocab size
5c6dec09
SaulLu simplify test
de3353fb
SaulLu assert test raised
8485770a
SaulLu print error msg
df244924
SaulLu check msg error
46fc9dac
SaulLu check error
9ffafb12
SaulLu woups
1eb5baa4
SaulLu clean
56af695d
SaulLu simplify
3ea0c6bc
SaulLu remove unused print
be2e371b
SaulLu add comment
89869625
SaulLu add test multiple of tp size
a72fa034
SaulLu add print
1e5b2af3
SaulLu add check
8d8be7ea
SaulLu SaulLu changed the title [WIP] set the tokenizer vocab size add `pad-vocab-size-to` argument and tests 3 years ago
SaulLu SaulLu requested a review from DanielHesslow DanielHesslow 3 years ago
SaulLu SaulLu requested a review from thomasw21 thomasw21 3 years ago
SaulLu SaulLu requested a review from stas00 stas00 3 years ago
SaulLu
SaulLu commented on 2022-02-28
SaulLu clean
b2867a7b
stas00
stas00 commented on 2022-02-28
thomasw21
thomasw21 commented on 2022-02-28
stas00
stas00 commented on 2022-02-28
SaulLu Update megatron/mpu/layers.py
ef61e898
SaulLu Update megatron/tokenizer/tokenizer.py
c10a3598
SaulLu chnage micro-batch-size
fc975b44
SaulLu use tiny vocab
a2b86b74
SaulLu fix data dir
ae9f83c1
SaulLu fix arg
ecdda509
SaulLu change micro-batch-size
c170fd9d
SaulLu adept input ids
c82d6154
SaulLu assertIn
3587b52c
SaulLu change micro batch size
a90a8f99
DanielHesslow
SaulLu Fix test TP
982d88c2
SaulLu unused var
78b76861
SaulLu add test make_vocab_size_divisible_by
c9222042
SaulLu fix test_tokenizer_vocab_size_multiple_of_tp_size test
806cbb5f
thomasw21 Fix padded vocab size on preprocessing scripts (#257)
f515b67f
SaulLu documentation
02f86f57
thomasw21
thomasw21 approved these changes on 2022-03-01
SaulLu SaulLu merged 58d92042 into main 3 years ago
SaulLu SaulLu deleted the LS/vocab_size branch 3 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone