text-generation-inference
5cd8025f
- hotfix: fix regression of attention api change in intel platform (#2439)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
hotfix: fix regression of attention api change in intel platform (#2439) fix regression caused by attention api change. ipex.varlen_attention does not support paged-cache format kv input now. Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
References
#2439 - hotfix: fix regression of attention api change in intel platform
Author
sywangyi
Parents
e279b38a
Loading