transformers
aef12349 - Make HF implementation match original OLMo 2 models for lower precisions (#38131)

Commit
308 days ago
Make HF implementation match original OLMo 2 models for lower precisions (#38131) * Make HF implementation match OLMo models for lower precisions * Add test of 1B logits in bfloat16 * Run make fixup
Author
Parents
Loading