llama.cpp
7c7836d9 - Vulkan Shader Refactor, Memory Debugging Option (#7947)

Commit
357 days ago
Vulkan Shader Refactor, Memory Debugging Option (#7947) * Refactor shaders, extract GLSL code from ggml_vk_generate_shaders.py into vulkan-shaders directory * Improve debug log code * Add memory debug output option * Fix flake8 * Fix unnecessary high llama-3 VRAM use
Author
Parents
  • File
    CMakeLists.txt
  • File
    Makefile
  • File
    ggml-vulkan-shaders.hpp
  • File
    ggml-vulkan.cpp
  • File
    ggml_vk_generate_shaders.py
  • vulkan-shaders
    • File
      add.comp
    • File
      argsort.comp
    • File
      clamp.comp
    • File
      copy.comp
    • File
      dequant_f32.comp
    • File
      dequant_funcs.comp
    • File
      dequant_head.comp
    • File
      dequant_q2_k.comp
    • File
      dequant_q3_k.comp
    • File
      dequant_q4_0.comp
    • File
      dequant_q4_1.comp
    • File
      dequant_q4_k.comp
    • File
      dequant_q5_0.comp
    • File
      dequant_q5_1.comp
    • File
      dequant_q5_k.comp
    • File
      dequant_q6_k.comp
    • File
      dequant_q8_0.comp
    • File
      diag_mask_inf.comp
    • File
      div.comp
    • File
      gelu.comp
    • File
      generic_binary_head.comp
    • File
      generic_head.comp
    • File
      generic_unary_head.comp
    • File
      get_rows.comp
    • File
      get_rows_quant.comp
    • File
      mul.comp
    • File
      mul_mat_split_k_reduce.comp
    • File
      mul_mat_vec.comp
    • File
      mul_mat_vec_base.comp
    • File
      mul_mat_vec_nc.comp
    • File
      mul_mat_vec_p021.comp
    • File
      mul_mat_vec_q2_k.comp
    • File
      mul_mat_vec_q3_k.comp
    • File
      mul_mat_vec_q4_k.comp
    • File
      mul_mat_vec_q5_k.comp
    • File
      mul_mat_vec_q6_k.comp
    • File
      mul_mm.comp
    • File
      norm.comp
    • File
      relu.comp
    • File
      rms_norm.comp
    • File
      rope_head.comp
    • File
      rope_neox.comp
    • File
      rope_norm.comp
    • File
      scale.comp
    • File
      silu.comp
    • File
      soft_max.comp
    • File
      square.comp
    • File
      sum_rows.comp
    • File
      types.comp