Add cuda kernel support for GGUF inference #11869
add gguf kernel support
6c4d01de
fix
66bd237b
optimize
e46571a7
DN6
commented
on 2025-07-07
update
de1fb4b6
update
db94e2b5
update
cb004ad5
DN6
marked this pull request as ready for review 229 days ago
update
5c4eee56
Merge branch 'main' into gguf-kernel
d10d1611
update
98754e28
DN6
approved these changes
on 2025-08-05
DN6
merged
ba2ba901
into main 217 days ago
DN6
added roadmap
Isotr0py
deleted the gguf-kernel branch 216 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub