Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
intel/auto-round
Pull Requests
Commits
hengguo/refactor_algs
AutoAdamRound_bugfix
Chinesization
ZaneMark-patch-1
ZaneMark-patch-3
acp
actvation_quant
add_task_args_for_lmeval
ar_agent
ark_v0.13.4
ark_zp
autoround_support_qbits_backend
bf16_scale
chore/claude-init
copilot/add-xpu-moe-decode-implementation
copilot/convert-script-improvements
copilot/fix-corner-case-in-auto-round
copilot/fix-deprecated-fp-layers-handling
copilot/fix-docstrings-in-python-files
copilot/fix-issue-with-auto-rounding
copilot/fix-llm-type-70b-bits-setting
copilot/fix-typeerror-wrapped-fn
copilot/fix-vllm-model-inference-issue
copilot/improve-pr-template-type-of-change
copilot/investigate-quantization-group-and-ffn
copilot/replace-getset-module-torch-api
copilot/sageattention
copilot/speedup-fp8-linear-convert
copilot/speedup-fp8-linear-convert-again
copilot/speedup-fp8-linear-convert-another-one
copilot/sub-pr-1237-again
copilot/sub-pr-1237
copilot/sub-pr-1324
copilot/sub-pr-1522-again
copilot/sub-pr-1532
copilot/update-user-settings-page
copilot/vscode-mo3shmf8-8qa6
ddp
debug_time_cost
debug/usable_rotation
debug-hang
debug-nvfp4
deepseekv3
ds-qwen
ds-v5
ds-v32
dsv4
enable_glm4_moe_lite_quantization
enable_llama4_int8_baseline
enable_llama4_quant
enable_mxfp_exporting
feat/activation-checkpointing
feat/ark-xpu-int3-woq-gemm
feat/ark-xpu-int3-woq-gemv
feat/autoround-quarot
feature/overlap_for_nblocks
fix/ark-pack-weight-validation
fix_bug0627
fix_bug_0722
fix_bug_1105
fix_compile
fix_disable_act_dynamic_usage_in_mxfp.py
fix_dq
fix/fp-layers-deprecation-mapping
fix_gemma3_issue
fix_gguf_fp8
fix_gptqmodel
fix_low_cpu
fix_rotation
fix_save_quantized_func_nvfp_checker
fix_0107
fix_0109
fix_0113
fix-attn-mask-b60
fix-ds
fix-flashinfer
fix-gpt-oss
fix-hpu
fix-to-meta-assertion-error-1499
fixbug_0717
fp4_v2
fp4_v3
fp8-cache
fp8-cache-based-export
fp8-static-quant-patch
fp8_export_backup_stable
fp8_export_for_test
good-flux
hengguo/fix_cuda_ut
hengguo/fix_gguf_ds
hengguo/gguf_fix_615
hengguo/quantizers
hengguo/refactor_algs
hengguo/refactor_init
hengguo/refactor_quant_step1
hengguo/smoothquant
hengguo/w4afp8_sim
henguo/update_so
hpu_only_kg
hpu_only_pkg
hpu/only/v1
hpu-limit-tran
kaihui/torch_dtype
lazy-model-replace
leq_opub
lib/pre-4.4.0
llama/new/9-610
llama/new/9
llm-main
llmc
llmc-backup
llmc-test
lm-head-quant
load-kv
load-w8a8-replace-mod
load-w8a8
lvl/autoscheme_ram_opt
lvl/cpu_ram_optimization
lvl/fix_no_init_weights
lvl/general_moe_replacement
lvl/support_diffusiongemma
lvl/support_fp8_with_ark
lvl/support_hunyuan_image
lvl/support_turbo_quant
lyt/numpy_fix
lyt/omni
main
marlin_modify
mengni/bug_fix
mengni/expert
mengni/mengni/block_wise
mengni/new_vllm
mengni/vllm
mengni/vlm
mengniwang95-patch-1
more-ar-ext
mxfp8
origin/block_wise
patch/for/ao/581/stable
patch-for-ao-2
pr1775-followup
pre-release/internal-inc/w4a8
quant-attn-hpu
quant-attn-hpu-o-scale
quant-attn-hpu-pr
quant-llama
quarot-llama
qwen3-vl
qwen3_vl_moe
qwen-split
qwen-v5
refine_device_tmp
refine_device
refine-doc-table
replace-lm-head
revert_order
revert-318-fix/hpu/check
revert-1231-set_disable_opt_rtn_default_2_none
revert-1562-suyue/ut
save_memory
set_disable_opt_rtn_default_2_none
sparse-attn
sparse-attn-clean
sparse-attn-v0
static_quant
support_gemma4
suyue/ci
test-git
try_new_optimizer
try_to_fix_hadamard_regression
update_fp_compile
update_rm
update_0522
update_0819
upstream-ao
use-ep
ut-time
v0.7.0rc
v0.7.1rc
v0.8.0rc
v0.8.0rc2
v0.9.1rc
v0.9.2-release
v0.9.2rc
v0.9.3rc
v0.9.4rc
v0.9.5rc
v0.9.6rc
v0.9.7rc
v0.10.0rc
v0.10.1rc
v0.10.2rc
v0.10.3rc
v0.12.0rc
v0.12.1rc
v0.12.2rc
v0.12.3rc
v0.13.0rc
v0.13.1rc
vllm-sharing-deck-2026
w4a4_int_quaro
w4int8dynamic
wangchang/fix_oom
wangchang/vllm
weiwei/mxfp_exp
wfp8-afp8-bk
xin3he-patch-1
xinhe/3-20c
xinhe/3-27c
xinhe/3-27d
xinhe/4-7
xinhe/4-15
xinhe/5-11
xinhe/6-17
zhenzhong/toolkit_release
Merge branch 'hengguo/refactor_algs' of https://github.com/intel/auto-round into hengguo/refactor_algs
n1ck-guo
committed
9 days ago
6f219c60
performance
n1ck-guo
committed
9 days ago
3fecf0bc
performance
n1ck-guo
committed
9 days ago
12814d5f
add type annotation
n1ck-guo
committed
10 days ago
241cb393
Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
n1ck-guo
committed
10 days ago
8c731933
fix awq cuda CI (#1912)
WeiweiZhang1
committed
10 days ago
be67ef3a
fix inplace rotation issue (#1903)
wenhuach21
committed
11 days ago
d13b1ddd
[ARK] update README (#1906)
luoyu-intel
committed
11 days ago
0068522d
fallback compute type on b70 if needed (#1904)
yiliu30
committed
11 days ago
83cbe978
fix: guard zero-division in GGUF quant kernels to avoid NaN block scales (#1909)
Entrpi
committed
11 days ago
d6153cb7
fix gguf opt-rtn regression (#1905)
wenhuach21
committed
11 days ago
bfa795ca
update llama-cpp-python installation for CUDA CI (#1907)
XuehaoSun
committed
11 days ago
fb9a772f
feat: improve review-pr skill score from 76% to 90% (#1901)
yogesh-tessl
committed
12 days ago
205e5f60
Fix slow startup time of pytest coverage for unit tests (#1899)
XuehaoSun
committed
12 days ago
2890673c
fix
n1ck-guo
committed
12 days ago
eac3ab8d
Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
n1ck-guo
committed
12 days ago
2a12b197
feat: add MXFP4/MXFP8 quantization support (llmc_compressor format) and related tests (#1865)
xin3he
committed
12 days ago
6cdb2a20
Fix CI coverage & bug grep issue (#1893)
chensuyue
committed
12 days ago
5ed21d3e
[step 1]refine code to support all devices in torch and hot fix for gemma4-unified (#1879)
wenhuach21
committed
13 days ago
2794a6e2
Update auto-round-lib release package build (#1895)
chensuyue
committed
13 days ago
0de1eb05
fix random rotation and update rotation doc. (#1884)
lkk12014402
committed
13 days ago
6afd14c2
change num_samples to property
n1ck-guo
committed
13 days ago
30f1dd05
refactor pipeline
n1ck-guo
committed
13 days ago
7fb0839e
clean and update
n1ck-guo
committed
14 days ago
9edf0d03
Merge branch 'hengguo/refactor_algs' of https://github.com/intel/auto-round into hengguo/refactor_algs
n1ck-guo
committed
17 days ago
fde14a58
Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
n1ck-guo
committed
17 days ago
42d77710
Merge branch 'main' into hengguo/refactor_algs
n1ck-guo
committed
17 days ago
2638c1c7
fix
n1ck-guo
committed
18 days ago
5a968849
fix performance regression (#1886)
wenhuach21
committed
18 days ago
9f254fbe
Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
n1ck-guo
committed
18 days ago
4fc9724a
Older