llama.cpp
[CANN] Support Q4_0 for Ascend NPU
#8822
Merged

[CANN] Support Q4_0 for Ascend NPU #8822

hipudding merged 1 commit into ggml-org:master from wangshuai09:q8_0
wangshuai09
github-actions github-actions added testing
github-actions github-actions added ggml
wangshuai09 wangshuai09 force pushed 1 year ago
wangshuai09 wangshuai09 changed the title [CANN] Add CPY for Q4_0 [CANN] Support Q4_0 for Ascend NPU 1 year ago
hipudding
hipudding commented on 2024-08-05
hipudding hipudding requested a review from hipudding hipudding 1 year ago
hipudding hipudding added Ascend NPU
hipudding
hipudding commented on 2024-08-05
wangshuai09 cann: support q4_0 model
514678a2
wangshuai09 wangshuai09 force pushed to 514678a2 1 year ago
wangshuai09 wangshuai09 marked this pull request as ready for review 1 year ago
hipudding hipudding requested a review from hipudding hipudding 1 year ago
hipudding
hipudding approved these changes on 2024-08-05
hipudding hipudding merged c02b0a8a into master 1 year ago
fan-chao
wangshuai09
fan-chao
hipudding

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone