transformers
GGUF: optional Metal dequant fast path via kernels-community
#45975
Open

GGUF: optional Metal dequant fast path via kernels-community #45975

ArthurZucker wants to merge 3 commits into update-gguf from gguf-metal-kernels
ArthurZucker
ArthurZucker GGUF: optional Metal dequant fast path via kernels-community/gguf-deq…
be73645e
ArthurZucker serve: auto-select metal-flash-sdpa attention on MPS
9fc9687b
ArthurZucker Point Metal kernel repo to ArthurZ/gguf-dequant until transfer to ker…
b4fb5d71
ArthurZucker
HuggingFaceDocBuilderDev
ArthurZucker

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone