llm-foundry
86a99e2a
- Use torch.repeat instead of expand on key & value in Triton MQA to prevent NaNs with certain h_dims (#442)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
Use torch.repeat instead of expand on key & value in Triton MQA to prevent NaNs with certain h_dims (#442)
References
#442 - Use torch.repeat instead of expand on key & value in Triton MQA to prevent NaNs with certain h_dims
Author
sashaDoubov
Parents
62e2feac
Loading