peterchen-intel
changed the title Extend moe_3gemm to all Ultra series iGPU Extend moe_3gemm to all GPUs and reduce internal buffer holding in paged_attention_opt42 days ago
peterchen-intel
changed the title Extend moe_3gemm to all GPUs and reduce internal buffer holding in paged_attention_opt Extend moe_3gemm to all Intel GPUs12 days ago
Login to write a write a comment.
Login via GitHub