llama.cpp
a1327c71 - parallel : rename hot-plug to continuous-batching

Commit

2 years ago

parallel : rename hot-plug to continuous-batching

References

#3228 - llama : custom attention mask + parallel decoding + no context swaps

Author

ggerganov

ggerganov

Committer

ggerganov

ggerganov

Parents

Loading