transformers
2230d149
- fix get_keys_to_not_convert() to return correct modules for full precision inference (#25105)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
fix get_keys_to_not_convert() to return correct modules for full precision inference (#25105) * add test for `get_keys_to_not_convert` * add minimum patch to keep mpt lm_head from 8bit quantization * add reivsion to
References
#25105 - fix get_keys_to_not_convert() to return correct modules for full precision inference
Author
ranchlai
Parents
f6f567d0
Loading