llama.cpp
99b71c06 - Server: Use multi-task for embeddings endpoint (#6001)

Commit
1 year ago
Server: Use multi-task for embeddings endpoint (#6001) * use multitask for embd endpoint * specify types * remove redundant {"n_predict", 0}
Author
Parents
Loading