llama.cpp
58d07a80 - metal : copy kernels for quant to F32/F16 conversions (#12017)

Commit

1 year ago

metal : copy kernels for quant to F32/F16 conversions (#12017) metal: use dequantize_q templates --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

References

#12017 - metal: Copy kernels for quant to F32 conversions (#10976).

Author

gcp

Parents

34a846b5

llama.cpp 58d07a80 - metal : copy kernels for quant to F32/F16 conversions (#12017)

llama.cpp
58d07a80 - metal : copy kernels for quant to F32/F16 conversions (#12017)