llama.cpp
4afb0a74 - server : Support multimodal completion and embeddings prompts in JSON format (#15108)

Commit

75 days ago

server : Support multimodal completion and embeddings prompts in JSON format (#15108) - Use server_tokens in more places in server and util.cpp - Convert most functions that used llama_tokens to server_tokens - Modify input tokenizer to handle JSON objects as subprompts - Break out MTMD prompt parsing into utility function - Support JSON objects with multimodal_data arrays for MTMD prompts along with other existing types - Add capability to model endpoint to indicate if client can send multimodal data - Add tests.

References

#15108 - mtmd: server: Support multimodal data prompt in /completions and /embeddings endpoint of server

Author

65a

Parents

e2886936

llama.cpp 4afb0a74 - server : Support multimodal completion and embeddings prompts in JSON format (#15108)

llama.cpp
4afb0a74 - server : Support multimodal completion and embeddings prompts in JSON format (#15108)