PR #8100 CUDA: fix MMQ writeback for int8 tensor cores

CUDA: fix MMQ writeback for int8 tensor cores #8100

JohannesGaessler merged 1 commit into ggml-org:master from JohannesGaessler:cuda-fix-mmq-writeback

CUDA: fix MMQ writeback for int8 tensor cores

0402d4f8

github-actions added Nvidia GPU

github-actions added ggml

slaren approved these changes on 2024-06-24

JohannesGaessler merged 3b099bcd into master 1 year ago

Reviewers

slaren

Assignees

No one assigned

Labels

Nvidia GPU ggml

Milestone

No milestone