llama.cpp
3edd87cd
- opencl: optimize mxfp4 kernels (#16037)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 day ago
opencl: optimize mxfp4 kernels (#16037) - flatten mxfp4 and packed fp4->fp16 bit-wise convert function (replace lut) - MoE kernel optimizations --------- Co-authored-by: Li He <lih@qti.qualcomm.com>
References
#16037 - OpenCL: MoE MXFP4 kernel optimizations
Author
shawngu-quic
Parents
c0b45097
Loading