ggml-cuda : add TQ2_0 support - SemanticDiff

Commit

1 year ago

ggml-cuda : add TQ2_0 support

References

#11183 - ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU

Author

compilade

compilade

Parents

Loading