Torchao float8 training #3348
vkuzo
commented
on 2025-01-16
vkuzo
commented
on 2025-01-16
vkuzo
commented
on 2025-01-16
muellerzr
changed the title Torchao float8 training [WIP/MVP] Torchao float8 training 364 days ago
Bookmark
febc6344
bookmark
a7663c51
Add torchao base example
ed1adb1d
Currently broken
3d34f8ec
Clean
a032f1a5
DDP varient working
71979436
FSDP as well
dc797fde
Works for all but zero3
145dec2c
Bookmark: currently zero3 is underperforming
676d5ac1
Bookmark
92b3d9bb
muellerzr
force pushed
from
f8058764
to
92b3d9bb
340 days ago
Another diff
660b6b59
Fin
ca45f463
Fin
a0193ce2
Add req huggingface suite
b271b139
update tests for fp8/torchao/ddp
a5d2c29f
Log FP8 backend used and adjust typing
002c4be2
add documentation for convert_to_float8_training
95fbb8dc
Rename to convert_model_to_fp8_ao
ac222966
Call superinit"
06642ca1
Add types
d46b0a1d
Clean
62881acb
SunMarc
approved these changes
on 2025-02-12
Use filter_first_and_last_linear_layers
f3ceb37b
Update usage guide docs
14f6d04e
Actually loop through the zero stages
a8b6b8c6
Clean
f222d721
SunMarc
approved these changes
on 2025-02-17
muellerzr
merged
8039158d
into main 333 days ago
muellerzr
deleted the torchao-float8 branch 333 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub