llama.cpp
llama-model : support Qwen2 embedding models and pooling_mode_lasttoken
#13245
Merged

Commits
  • llama-model : support Qwen2 embedding models and pooling_mode_lasttoken
    cebtenzzre committed 239 days ago
  • set hf_arch in TextModel.__init__
    cebtenzzre committed 239 days ago
  • appease pyright
    cebtenzzre committed 239 days ago
  • add explicit type annotation
    cebtenzzre committed 239 days ago
Loading