Add GQA on CPU in LLaMA scripts #20720
Add changes to support GQA for CPU
fc92f7a8
Enable trusting remote code
10750bc2
Set pad token id to EOS token id
4e0738b9
tianleiwu
dismissed these changes
on 2024-05-18
Add GQA for FP32 CPU during optimization
115b8cad
Enable trusting remote code via authentication
8051ef76
Add changes suggested by linter
f0ba1f01
kunal-vaishnavi
changed the title Add support for GQA on CPU in LLaMA benchmarking scripts Add GQA on CPU in LLaMA scripts 1 year ago
tianleiwu
approved these changes
on 2024-05-18
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub