xla
f008ec88 - Allowing `float16` and `bfloat16` computation type and sharding groups in XLA FSDP (#3588)

Commit

3 years ago

Allowing `float16` and `bfloat16` computation type and sharding groups in XLA FSDP (#3588) * allow specifying computation dtype other than float32 in XLA FSDP; also allow groups for sharding and grad clipping * minor update and lint * minor fix on model `extra_repr`

References

#3588 - Allowing `float16` and `bfloat16` computation type and sharding groups in XLA FSDP

Author

ronghanghu

Parents

08084ea0

xla f008ec88 - Allowing `float16` and `bfloat16` computation type and sharding groups in XLA FSDP (#3588)

xla
f008ec88 - Allowing `float16` and `bfloat16` computation type and sharding groups in XLA FSDP (#3588)