vllm
e22ee1e7 - [Kernel] GGUF MoE kernel (#14613)

Commit
106 days ago
[Kernel] GGUF MoE kernel (#14613) Signed-off-by: SzymonOzog <szymon.ozog@aleph-alpha.com>
Author
Parents
  • csrc
    • File
      ops.h
    • quantization/gguf
      • File
        gguf_kernel.cu
      • File
        moe.cuh
    • File
      torch_bindings.cpp
  • tests/kernels
    • File
      test_ggml.py
    • File
      test_gguf.py
  • vllm
    • File
      _custom_ops.py
    • model_executor/layers/quantization
      • File
        gguf.py