llama.cpp
a3738b2f - vulkan : implement Stable Diffusion operators (ggml/904)

Commit
323 days ago
vulkan : implement Stable Diffusion operators (ggml/904) * Fix Vulkan repeat op * Implement Vulkan concat op * Delete old Vulkan shader generator * Implement Vulkan im2col op * Implement Vulkan unary gelu_quick op * Implement Vulkan group_norm op * Implement Vulkan timestep_embedding op * Implement Vulkan upscale op * Fix Vulkan vk_context tensor extra index issue * Fix Vulkan matmul shader parameter bug * Properly fix Vulkan matmul shader parameter bug * Add Vulkan ADD f16 + f32 -> f16 operator support * Implement Vulkan tanh op * Fix Vulkan group count too large Validation error on non-Nvidia GPUs * Throw error when too much memory is requested * Fix another Vulkan group count too large Validation error on non-Nvidia GPUs * Fix matmul MMQ condition * Implement Vulkan pad op * Fix Vulkan crash when tensor is used multiple times in a compute graph * Add Vulkan CONCAT f16 + f16 -> f16 op * Add Vulkan LEAKY_RELU op
Author
Committer
Parents
  • ggml/src
    • File
      ggml-vulkan.cpp
    • vulkan-shaders
      • File
        add.comp
      • File
        clamp.comp
      • File
        concat.comp
      • File
        copy.comp
      • File
        div.comp
      • File
        gelu.comp
      • File
        gelu_quick.comp
      • File
        generic_binary_head.comp
      • File
        generic_unary_head.comp
      • File
        group_norm.comp
      • File
        im2col.comp
      • File
        leaky_relu.comp
      • File
        mul.comp
      • File
        norm.comp
      • File
        pad.comp
      • File
        relu.comp
      • File
        rms_norm.comp
      • File
        scale.comp
      • File
        silu.comp
      • File
        soft_max.comp
      • File
        square.comp
      • File
        sum_rows.comp
      • File
        tanh.comp
      • File
        timestep_embedding.comp
      • File
        types.comp
      • File
        upscale.comp
      • File
        vulkan-shaders-gen.cpp