llama.cpp
Fix Vulkan Quantized Matrix Vector Multiplication on AMD GPUs when ncols < 64
#8855
Merged

Fix Vulkan Quantized Matrix Vector Multiplication on AMD GPUs when ncols < 64 #8855

ggerganov merged 2 commits into master from 0cc4m/vulkan-fix-mmv-tests
0cc4m
0cc4m Fix Vulkan mul mat vec invalid results when ncols < warp size
ecabd54d
0cc4m Only run backend ops mul mat vec block size test if block size not al…
6c75cb95
github-actions github-actions added testing
0cc4m 0cc4m changed the title 0cc4m/vulkan fix mmv tests Fix Quantized Matrix Vector Multiplication on AMD GPUs when ncols < 64 1 year ago
JohannesGaessler JohannesGaessler added Vulkan
0cc4m 0cc4m changed the title Fix Quantized Matrix Vector Multiplication on AMD GPUs when ncols < 64 Fix Vulkan Quantized Matrix Vector Multiplication on AMD GPUs when ncols < 64 1 year ago
ggerganov
ggerganov approved these changes on 2024-08-05
ggerganov ggerganov merged 064cdc26 into master 1 year ago
0cc4m 0cc4m deleted the 0cc4m/vulkan-fix-mmv-tests branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone