enable ptq/qat quantization
b24f6021
update dynamic shape
79850abb
update
264018e7
eanble hf ptq
a6acd0d8
refine
fad184cf
Init XPU backend for PT2E
8698a4fb
Enable XPUInductorQuantier and fix SDPA attn_mask device
2a966b24
Disable Inductor Cache on XPU
52d0802b
Resolve hard code for device in utils
2f21dad4
Refine code style
599d83f1
Move all model buffers to XPU
39dd3845
Replace deprecated API capture_pre_autograd_graph with export_for_tra…
5f8857aa
Set freezing, add gpu synchronize
0ac33f6f
Enable quant config switching
2a935fdc
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub