llm-foundry
86a99e2a - Use torch.repeat instead of expand on key & value in Triton MQA to prevent NaNs with certain h_dims (#442)

Commit
2 years ago
Use torch.repeat instead of expand on key & value in Triton MQA to prevent NaNs with certain h_dims (#442)
Author
Parents
Loading