transformers
Efficient Inference Kernel for SpQR
#34976
Merged

Efficient Inference Kernel for SpQR #34976

MekkCyber merged 44 commits into huggingface:main from elvircrn:spqr-quantizer
elvircrn
elvircrn elvircrn marked this pull request as draft 1 year ago
elvircrn elvircrn changed the title Spqr quantizer Efficient Inference Kernel for SpQR 1 year ago
SunMarc SunMarc requested a review from MekkCyber MekkCyber 1 year ago
MekkCyber
MekkCyber
MekkCyber commented on 2024-11-29
elvircrn elvircrn force pushed 1 year ago
elvircrn elvircrn marked this pull request as ready for review 1 year ago
elvircrn
elvircrn elvircrn force pushed 1 year ago
elvircrn elvircrn force pushed 1 year ago
elvircrn elvircrn force pushed 1 year ago
elvircrn elvircrn force pushed 1 year ago
MekkCyber
elvircrn elvircrn force pushed 1 year ago
elvircrn
elvircrn elvircrn force pushed 1 year ago
MekkCyber
MekkCyber
MekkCyber commented on 2024-12-09
elvircrn elvircrn force pushed 1 year ago
elvircrn
MekkCyber
MekkCyber
MekkCyber commented on 2024-12-09
elvircrn elvircrn force pushed 1 year ago
elvircrn
MekkCyber MekkCyber requested a review from SunMarc SunMarc 1 year ago
elvircrn
elvircrn
elvircrn elvircrn force pushed 1 year ago
elvircrn elvircrn force pushed 1 year ago
SunMarc
SunMarc approved these changes on 2024-12-13
elvircrn elvircrn force pushed 1 year ago
elvircrn
elvircrn elvircrn force pushed 1 year ago
elvircrn
elvircrn elvircrn force pushed 1 year ago
MekkCyber MekkCyber requested a review from ArthurZucker ArthurZucker 1 year ago
MekkCyber
elvircrn elvircrn force pushed 353 days ago
elvircrn
elvircrn elvircrn requested a review from ydshieh ydshieh 340 days ago
elvircrn elvircrn requested a review from Rocketknight1 Rocketknight1 340 days ago
elvircrn elvircrn requested a review from muellerzr muellerzr 340 days ago
elvircrn elvircrn requested a review from stevhliu stevhliu 340 days ago
stevhliu
stevhliu commented on 2025-01-09
elvircrn
elvircrn
elvircrn Resolve vptq conflict
7f4aa051
elvircrn Rename spqr package to spqr_quant
ac4b1426
elvircrn Get rid of aqlm mention
8c3f5f16
elvircrn Start working on tests
ff61b8e9
elvircrn Resolve ruff code checks
23c3a24a
elvircrn Ruff format
0cb5ba7a
elvircrn Isort
c980e667
elvircrn Test updates
163983b9
elvircrn Add gpu tag
f51d3d10
elvircrn Rename to modules_to_not_convert
24ca92f4
elvircrn Config update
913fbcb3
elvircrn Docs and config update
d4331652
elvircrn Docs and config update
5582beb1
elvircrn Update to update_torch_dtype
3d64f881
elvircrn spqr config parameter validation
81237de9
elvircrn Ruff update
1dacd501
elvircrn Apply ruff fixes
c1a4304f
elvircrn Test fixes
53c53c06
elvircrn Ruff update
c21c4129
elvircrn Mark tests as @slow again; Ruff; Docstring update
dc89200e
elvircrn Ruff
64929f7e
elvircrn Remove absolute path
fada970a
elvircrn Resolve typo
4694339a
elvircrn Remove redundandt log
92ea4934
elvircrn Check accelerate/spqr availability
14944538
elvircrn Ruff fix
9e8f4707
elvircrn Check if the config contains proper shapes
525dcdfe
elvircrn Ruff test
1a54d862
elvircrn Documentation update
68afc893
elvircrn overview update
0eff9441
elvircrn Ruff checks
82e7f4e7
elvircrn Ruff code quality
274d3683
elvircrn Make style
a630d6d2
elvircrn Update docs/source/en/quantization/spqr.md
96b2613f
elvircrn Update spqr.md
17d1c72a
elvircrn elvircrn force pushed to 17d1c72a 334 days ago
SunMarc
SunMarc
elvircrn
jiqing-feng Enable gptqmodel (#35012)
55b50c70
MekkCyber Fix : Nemotron Processor in GGUF conversion (#35708)
c4273e27
elvircrn elvircrn force pushed from 64d20397 to c4273e27 311 days ago
elvircrn Merge branch 'main' into spqr-quantizer
5aac5e3e
elvircrn
ArthurZucker
ArthurZucker approved these changes on 2025-01-16
ArthurZucker
elvircrn Update docs/source/en/quantization/spqr.md
95d2e743
elvircrn Add missing TOC to doc
3fdc0c38
elvircrn
elvircrn Merge branch 'huggingface:main' into spqr-quantizer
14f21c1b
elvircrn elvircrn force pushed from 14f21c1b to 03401523 305 days ago
elvircrn elvircrn force pushed from 03401523 to 14f21c1b 305 days ago
elvircrn
MekkCyber
elvircrn Merge branch 'main' into spqr-quantizer
cdefeafe
elvircrn
elvircrn
MekkCyber Merge branch 'main' into spqr-quantizer
8da4a66e
MekkCyber
elvircrn
MekkCyber Merge branch 'main' into spqr-quantizer
afff70ea
elvircrn
SunMarc
SunMarc approved these changes on 2025-02-13
MekkCyber MekkCyber merged 845b0a26 into main 305 days ago
elvircrn
HuggingFaceDocBuilderDev

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone