llama.cpp
99ed03a2 - metal : improve decoding speed for batches of 2-16

Commit

2 years ago

metal : improve decoding speed for batches of 2-16

References

#3524 - metal : support MTLGPUFamily < Apple7, formatting, style

Author

ggerganov

ggerganov

Parents

Loading