transformers
[`StableLm`] Add QK normalization and Parallel Residual Support
#29745
Merged

Loading