llama.cpp
6b86bcff - cuda : increase max block size to 1024

Login via GitHub
Home
Pricing
FAQ
Install

Login via GitHub

Commit

1 year ago

cuda : increase max block size to 1024

References

#4256 - ggml : add ggml_soft_max_ext

Author

ggerganov

ggerganov

Parents

FAQ Terms Privacy Refunds Impressum

Loading