Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
intel/auto-round
Pull Requests
Commits
Open
Closed
fix: replace bare except clauses with except Exception
#1981 opened 2026-07-01 22:41 by
ramkrishs
Support vLLM-based Model Quantization with llm_compressor Export
#1978 opened 2026-07-01 14:49 by
changwangss
Add hpu inference in CI test
#1976 opened 2026-07-01 08:39 by
chensuyue
[ARK] Support gemm using sycl-tla
#1968 opened 2026-06-30 09:15 by
Zhenzhong1
perf: back-pointer DP for AutoScheme bit allocation to cut path-copy RAM
#1959 opened 2026-06-26 21:34 by
SuperMarioYL
0.15.0
feat: add --dry-run VRAM/size estimation mode
#1958 opened 2026-06-26 19:37 by
mvanhorn
Fix UltraChat chat-template handling for Transformers v5
#1941 opened 2026-06-22 06:43 by
Copilot
Add quantization support for DiffusionGemma
#1935 opened 2026-06-17 13:59 by
lvliang-intel
Added prefill strategy benchmarking script and results
#1923 opened 2026-06-15 07:31 by
jijiaz
[draft]refine device
#1900 opened 2026-06-09 07:40 by
wenhuach21
feat: add overlap function for multi-blocks compression
#1850 opened 2026-05-25 06:18 by
ZaneMark
Add moe prefill/ decode with int2/int4/int8 sym /asym and fp8 e4m3 e5m2
#1813 opened 2026-05-14 04:27 by
Copilot
feat: support Nemotron-H / Nemotron-Cascade-2 (#1711)
#1712 opened 2026-04-20 18:41 by
michael-rabe
Continuously optimize AutoScheme RAM consumption
#1703 opened 2026-04-17 05:43 by
lvliang-intel