llama.cpp
02115dcd - vulkan: Allow up to 4096 elements for mul_mat_id row_ids (#13326)

Commit
97 days ago
vulkan: Allow up to 4096 elements for mul_mat_id row_ids (#13326) This assert fired running Qwen_Qwen3-30B-A3B-Q2_K.gguf: GGML_ASSERT(nei0 * nei1 <= 3072); The tensor is 8 x 512. Increase this array size to accommodate.
Author
Parents
  • ggml/src/ggml-vulkan
    • File
      ggml-vulkan.cpp
    • vulkan-shaders
      • File
        mul_mm.comp
      • File
        mul_mm_cm2.comp
      • File
        mul_mmq.comp