Support larger hidden size in Attention Cuda kernel #7002
Support larger hidden size in Attention Cuda kernel
16470401
gh-yewang
marked this pull request as draft 5 years ago
gh-yewang
changed the title (WIP)Support larger hidden size in Attention Cuda kernel Support larger hidden size in Attention Cuda kernel 5 years ago
gh-yewang
changed the title Support larger hidden size in Attention Cuda kernel (WIP)Support larger hidden size in Attention Cuda kernel 5 years ago
Update attention_transpose.cu
165c5fcf
gh-yewang
changed the title (WIP)Support larger hidden size in Attention Cuda kernel Support larger hidden size in Attention Cuda kernel 5 years ago
gh-yewang
marked this pull request as ready for review 5 years ago
review comments
f2e84fcb
fix typo and add check in quantization
f0d8d96a
update readme
695f5b18
tianleiwu
approved these changes
on 2021-03-15
gh-yewang
merged
4e670f7a
into master 5 years ago
gh-yewang
deleted the wangye/hidden_size branch 5 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub